Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 99441 |
| Missing cells | 17348 |
| Missing cells (%) | 0.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 27.1 MiB |
| Average record size in memory | 285.3 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 15 |
| DateTime | 1 |
order_id has a high cardinality: 99441 distinct values | High cardinality |
customer_id has a high cardinality: 99441 distinct values | High cardinality |
order_purchase_timestamp has a high cardinality: 98875 distinct values | High cardinality |
order_approved_at has a high cardinality: 90733 distinct values | High cardinality |
order_delivered_carrier_date has a high cardinality: 81018 distinct values | High cardinality |
order_delivered_customer_date has a high cardinality: 95664 distinct values | High cardinality |
order_estimated_delivery_date has a high cardinality: 459 distinct values | High cardinality |
product_most_frequent has a high cardinality: 31847 distinct values | High cardinality |
customer_unique_id has a high cardinality: 96096 distinct values | High cardinality |
customer_city has a high cardinality: 4119 distinct values | High cardinality |
product_id has a high cardinality: 31847 distinct values | High cardinality |
product_category_name_english has a high cardinality: 72 distinct values | High cardinality |
payment_value is highly overall correlated with sum_price and 2 other fields | High correlation |
sum_price is highly overall correlated with payment_value and 1 other fields | High correlation |
sum_freight_value is highly overall correlated with payment_value | High correlation |
customer_zip_code_prefix is highly overall correlated with customer_state | High correlation |
product_weight_g is highly overall correlated with payment_value and 4 other fields | High correlation |
product_length_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
product_height_cm is highly overall correlated with product_weight_g | High correlation |
product_width_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
customer_state is highly overall correlated with customer_zip_code_prefix | High correlation |
order_status is highly imbalanced (91.4%) | Imbalance |
payment_type is highly imbalanced (61.1%) | Imbalance |
order_delivered_carrier_date has 1783 (1.8%) missing values | Missing |
order_delivered_customer_date has 2965 (3.0%) missing values | Missing |
payment_sequential is highly skewed (γ1 = 23.94863702) | Skewed |
order_id is uniformly distributed | Uniform |
customer_id is uniformly distributed | Uniform |
order_purchase_timestamp is uniformly distributed | Uniform |
order_approved_at is uniformly distributed | Uniform |
order_delivered_carrier_date is uniformly distributed | Uniform |
order_delivered_customer_date is uniformly distributed | Uniform |
customer_unique_id is uniformly distributed | Uniform |
order_id has unique values | Unique |
customer_id has unique values | Unique |
length_comment_title has 87122 (87.6%) zeros | Zeros |
length_comment_message has 57894 (58.2%) zeros | Zeros |
product_description_lenght has 1420 (1.4%) zeros | Zeros |
product_photos_qty has 1420 (1.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-02-14 09:32:08.067965 |
|---|---|
| Analysis finished | 2023-02-14 09:33:14.292184 |
| Duration | 1 minute and 6.22 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
order_id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 99441 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| e481f51cbdc54678b7cc49136f2d6af7 | 1 |
|---|---|
| f01059d0d674e1282df4e8fbbe015aa2 | 1 |
| fbc17f0f2a2125054d5ac5c22d2d5120 | 1 |
| 9373150545066777b1cd2bc20e93cf8e | 1 |
| 917399e96f92268dfa2c0351b1b75fba | 1 |
| Other values (99436) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3182112 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 99441 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | e481f51cbdc54678b7cc49136f2d6af7 |
|---|---|
| 2nd row | 53cdb2fc8bc7dce0b6741e2150273451 |
| 3rd row | 47770eb9100c2d0c44946d9cf07ec65d |
| 4th row | 949d5b44dbf5de918fe9c16f97b45f8a |
| 5th row | ad21c59c0840e6cb83a9ceb5573f8159 |
Common Values
| Value | Count | Frequency (%) |
| e481f51cbdc54678b7cc49136f2d6af7 | 1 | < 0.1% |
| f01059d0d674e1282df4e8fbbe015aa2 | 1 | < 0.1% |
| fbc17f0f2a2125054d5ac5c22d2d5120 | 1 | < 0.1% |
| 9373150545066777b1cd2bc20e93cf8e | 1 | < 0.1% |
| 917399e96f92268dfa2c0351b1b75fba | 1 | < 0.1% |
| ed1691ef26bd8279bd5946561af1ff0d | 1 | < 0.1% |
| dc3006aa87f57332aaff74c57a5e094d | 1 | < 0.1% |
| f53eae1ce47dc68e8da117dd0d7feef1 | 1 | < 0.1% |
| 94bce2ab6f38b41d29ebbd9d755677bf | 1 | < 0.1% |
| 632f22d24375715fbfa8c0ae2e5d35b7 | 1 | < 0.1% |
| Other values (99431) | 99431 |
Length
| Value | Count | Frequency (%) |
| e481f51cbdc54678b7cc49136f2d6af7 | 1 | < 0.1% |
| 2ce1ad82022c1ba30c2079502ac725aa | 1 | < 0.1% |
| 949d5b44dbf5de918fe9c16f97b45f8a | 1 | < 0.1% |
| ad21c59c0840e6cb83a9ceb5573f8159 | 1 | < 0.1% |
| a4591c265e18cb1dcee52889e2d8acc3 | 1 | < 0.1% |
| 136cce7faa42fdb2cefd53fdc79a6098 | 1 | < 0.1% |
| 6514b8ad8028c9f2cc2374ded245783f | 1 | < 0.1% |
| 76c6e866289321a7c93b82b54852dc33 | 1 | < 0.1% |
| e69bfb5eb88e0ed6a785585b27e16dbf | 1 | < 0.1% |
| e6ce16cb79ec1d90b1da9085a6118aeb | 1 | < 0.1% |
| Other values (99431) | 99431 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 199814 | 6.3% |
| b | 199618 | 6.3% |
| 7 | 199334 | 6.3% |
| 6 | 199306 | 6.3% |
| e | 199225 | 6.3% |
| 2 | 199124 | 6.3% |
| 3 | 199011 | 6.3% |
| 1 | 198902 | 6.3% |
| a | 198879 | 6.2% |
| 9 | 198822 | 6.2% |
| Other values (6) | 1190077 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1989030 | |
| Lowercase Letter | 1193082 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 199814 | |
| 7 | 199334 | |
| 6 | 199306 | |
| 2 | 199124 | |
| 3 | 199011 | |
| 1 | 198902 | |
| 9 | 198822 | |
| 8 | 198629 | |
| 0 | 198434 | |
| 5 | 197654 |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 199618 | |
| e | 199225 | |
| a | 198879 | |
| f | 198774 | |
| c | 198454 | |
| d | 198132 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1989030 | |
| Latin | 1193082 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 199814 | |
| 7 | 199334 | |
| 6 | 199306 | |
| 2 | 199124 | |
| 3 | 199011 | |
| 1 | 198902 | |
| 9 | 198822 | |
| 8 | 198629 | |
| 0 | 198434 | |
| 5 | 197654 |
Latin
| Value | Count | Frequency (%) |
| b | 199618 | |
| e | 199225 | |
| a | 198879 | |
| f | 198774 | |
| c | 198454 | |
| d | 198132 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3182112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 199814 | 6.3% |
| b | 199618 | 6.3% |
| 7 | 199334 | 6.3% |
| 6 | 199306 | 6.3% |
| e | 199225 | 6.3% |
| 2 | 199124 | 6.3% |
| 3 | 199011 | 6.3% |
| 1 | 198902 | 6.3% |
| a | 198879 | 6.2% |
| 9 | 198822 | 6.2% |
| Other values (6) | 1190077 |
customer_id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 99441 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 9ef432eb6251297304e76186b10a928d | 1 |
|---|---|
| 413f7e58270a32396af030a075b924be | 1 |
| eb4350b67a0264c67e5e06a038e4afbb | 1 |
| 622b07d262d545d16efbd4363a89cb91 | 1 |
| c701fbfa77791abd05eef9eacf7ea7a8 | 1 |
| Other values (99436) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3182112 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 99441 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 9ef432eb6251297304e76186b10a928d |
|---|---|
| 2nd row | b0830fb4747a6c6d20dea0b8c802d7ef |
| 3rd row | 41ce2a54c0b03bf3443c3d931a367089 |
| 4th row | f88197465ea7920adcdbec7375364d82 |
| 5th row | 8ab97904e6daea8866dbdbc4fb7aad2c |
Common Values
| Value | Count | Frequency (%) |
| 9ef432eb6251297304e76186b10a928d | 1 | < 0.1% |
| 413f7e58270a32396af030a075b924be | 1 | < 0.1% |
| eb4350b67a0264c67e5e06a038e4afbb | 1 | < 0.1% |
| 622b07d262d545d16efbd4363a89cb91 | 1 | < 0.1% |
| c701fbfa77791abd05eef9eacf7ea7a8 | 1 | < 0.1% |
| 99ce553a3ac79b26416f2adca143760e | 1 | < 0.1% |
| 50900ea3519ead20da341b41081736e9 | 1 | < 0.1% |
| a4fe94a051d268fbbe8e4ca932ebc460 | 1 | < 0.1% |
| ba712872211b52224c61d5bedfc1bfcf | 1 | < 0.1% |
| f8b67d327058afa39382991d7173b1d7 | 1 | < 0.1% |
| Other values (99431) | 99431 |
Length
| Value | Count | Frequency (%) |
| 9ef432eb6251297304e76186b10a928d | 1 | < 0.1% |
| 7f2178c5d771e17f507d3c1637339298 | 1 | < 0.1% |
| f88197465ea7920adcdbec7375364d82 | 1 | < 0.1% |
| 8ab97904e6daea8866dbdbc4fb7aad2c | 1 | < 0.1% |
| 503740e9ca751ccdda7ba28e9ab8f608 | 1 | < 0.1% |
| ed0271e0b7da060a393796590e7b737a | 1 | < 0.1% |
| 9bdf08b4b3b52b5526ff42d37d47f222 | 1 | < 0.1% |
| f54a9f0e6b351c431402b8461ea51999 | 1 | < 0.1% |
| 31ad1d1b63eb9962463f764d4e6e0c9d | 1 | < 0.1% |
| 494dded5b201313c64ed7f100595b95c | 1 | < 0.1% |
| Other values (99431) | 99431 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 199366 | 6.3% |
| f | 199255 | 6.3% |
| 2 | 199235 | 6.3% |
| c | 199193 | 6.3% |
| 1 | 199150 | 6.3% |
| b | 199137 | 6.3% |
| 8 | 199094 | 6.3% |
| 3 | 199061 | 6.3% |
| 7 | 198923 | 6.3% |
| 6 | 198760 | 6.2% |
| Other values (6) | 1190938 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1988533 | |
| Lowercase Letter | 1193579 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 199366 | |
| 2 | 199235 | |
| 1 | 199150 | |
| 8 | 199094 | |
| 3 | 199061 | |
| 7 | 198923 | |
| 6 | 198760 | |
| 9 | 198689 | |
| 0 | 198310 | |
| 4 | 197945 |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 199255 | |
| c | 199193 | |
| b | 199137 | |
| e | 198713 | |
| a | 198646 | |
| d | 198635 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1988533 | |
| Latin | 1193579 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 199366 | |
| 2 | 199235 | |
| 1 | 199150 | |
| 8 | 199094 | |
| 3 | 199061 | |
| 7 | 198923 | |
| 6 | 198760 | |
| 9 | 198689 | |
| 0 | 198310 | |
| 4 | 197945 |
Latin
| Value | Count | Frequency (%) |
| f | 199255 | |
| c | 199193 | |
| b | 199137 | |
| e | 198713 | |
| a | 198646 | |
| d | 198635 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3182112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 199366 | 6.3% |
| f | 199255 | 6.3% |
| 2 | 199235 | 6.3% |
| c | 199193 | 6.3% |
| 1 | 199150 | 6.3% |
| b | 199137 | 6.3% |
| 8 | 199094 | 6.3% |
| 3 | 199061 | 6.3% |
| 7 | 198923 | 6.3% |
| 6 | 198760 | 6.2% |
| Other values (6) | 1190938 |
order_status
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| delivered | |
|---|---|
| shipped | 1107 |
| canceled | 625 |
| unavailable | 609 |
| invoiced | 314 |
| Other values (3) | 308 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.9834475 |
| Min length | 7 |
Characters and Unicode
| Total characters | 893323 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | delivered |
|---|---|
| 2nd row | delivered |
| 3rd row | delivered |
| 4th row | delivered |
| 5th row | delivered |
Common Values
| Value | Count | Frequency (%) |
| delivered | 96478 | |
| shipped | 1107 | 1.1% |
| canceled | 625 | 0.6% |
| unavailable | 609 | 0.6% |
| invoiced | 314 | 0.3% |
| processing | 301 | 0.3% |
| created | 5 | < 0.1% |
| approved | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| delivered | 96478 | |
| shipped | 1107 | 1.1% |
| canceled | 625 | 0.6% |
| unavailable | 609 | 0.6% |
| invoiced | 314 | 0.3% |
| processing | 301 | 0.3% |
| created | 5 | < 0.1% |
| approved | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 293027 | |
| d | 195009 | |
| i | 99123 | 11.1% |
| l | 98321 | 11.0% |
| v | 97403 | 10.9% |
| r | 96786 | 10.8% |
| p | 2519 | 0.3% |
| a | 2459 | 0.3% |
| c | 1870 | 0.2% |
| n | 1849 | 0.2% |
| Other values (7) | 4957 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 893323 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 293027 | |
| d | 195009 | |
| i | 99123 | 11.1% |
| l | 98321 | 11.0% |
| v | 97403 | 10.9% |
| r | 96786 | 10.8% |
| p | 2519 | 0.3% |
| a | 2459 | 0.3% |
| c | 1870 | 0.2% |
| n | 1849 | 0.2% |
| Other values (7) | 4957 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 893323 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 293027 | |
| d | 195009 | |
| i | 99123 | 11.1% |
| l | 98321 | 11.0% |
| v | 97403 | 10.9% |
| r | 96786 | 10.8% |
| p | 2519 | 0.3% |
| a | 2459 | 0.3% |
| c | 1870 | 0.2% |
| n | 1849 | 0.2% |
| Other values (7) | 4957 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 893323 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 293027 | |
| d | 195009 | |
| i | 99123 | 11.1% |
| l | 98321 | 11.0% |
| v | 97403 | 10.9% |
| r | 96786 | 10.8% |
| p | 2519 | 0.3% |
| a | 2459 | 0.3% |
| c | 1870 | 0.2% |
| n | 1849 | 0.2% |
| Other values (7) | 4957 | 0.6% |
order_purchase_timestamp
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 98875 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 2018-04-11 10:48:14 | 3 |
|---|---|
| 2018-07-28 13:11:22 | 3 |
| 2017-11-20 10:59:08 | 3 |
| 2018-08-02 12:05:26 | 3 |
| 2018-08-02 12:06:09 | 3 |
| Other values (98870) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1889379 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 98319 ? |
|---|---|
| Unique (%) | 98.9% |
Sample
| 1st row | 2017-10-02 10:56:33 |
|---|---|
| 2nd row | 2018-07-24 20:41:37 |
| 3rd row | 2018-08-08 08:38:49 |
| 4th row | 2017-11-18 19:28:06 |
| 5th row | 2018-02-13 21:18:39 |
Common Values
| Value | Count | Frequency (%) |
| 2018-04-11 10:48:14 | 3 | < 0.1% |
| 2018-07-28 13:11:22 | 3 | < 0.1% |
| 2017-11-20 10:59:08 | 3 | < 0.1% |
| 2018-08-02 12:05:26 | 3 | < 0.1% |
| 2018-08-02 12:06:09 | 3 | < 0.1% |
| 2018-06-01 13:39:44 | 3 | < 0.1% |
| 2018-03-31 15:08:21 | 3 | < 0.1% |
| 2018-02-19 15:37:47 | 3 | < 0.1% |
| 2018-08-02 12:06:07 | 3 | < 0.1% |
| 2017-11-20 11:46:30 | 3 | < 0.1% |
| Other values (98865) | 99411 |
Length
| Value | Count | Frequency (%) |
| 2017-11-24 | 1176 | 0.6% |
| 2017-11-25 | 499 | 0.3% |
| 2017-11-27 | 403 | 0.2% |
| 2017-11-26 | 391 | 0.2% |
| 2017-11-28 | 380 | 0.2% |
| 2018-05-07 | 372 | 0.2% |
| 2018-08-06 | 372 | 0.2% |
| 2018-08-07 | 370 | 0.2% |
| 2018-05-14 | 364 | 0.2% |
| 2018-05-16 | 357 | 0.2% |
| Other values (51442) | 194198 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 307586 | |
| 0 | 306287 | |
| 2 | 242536 | |
| - | 198882 | |
| : | 198882 | |
| 8 | 103570 | 5.5% |
| 99441 | 5.3% | |
| 7 | 92231 | 4.9% |
| 3 | 87960 | 4.7% |
| 4 | 80406 | 4.3% |
| Other values (3) | 171598 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1392174 | |
| Dash Punctuation | 198882 | 10.5% |
| Other Punctuation | 198882 | 10.5% |
| Space Separator | 99441 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 307586 | |
| 0 | 306287 | |
| 2 | 242536 | |
| 8 | 103570 | 7.4% |
| 7 | 92231 | 6.6% |
| 3 | 87960 | 6.3% |
| 4 | 80406 | 5.8% |
| 5 | 80169 | 5.8% |
| 6 | 47041 | 3.4% |
| 9 | 44388 | 3.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 198882 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 198882 |
Space Separator
| Value | Count | Frequency (%) |
| 99441 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1889379 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 307586 | |
| 0 | 306287 | |
| 2 | 242536 | |
| - | 198882 | |
| : | 198882 | |
| 8 | 103570 | 5.5% |
| 99441 | 5.3% | |
| 7 | 92231 | 4.9% |
| 3 | 87960 | 4.7% |
| 4 | 80406 | 4.3% |
| Other values (3) | 171598 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1889379 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 307586 | |
| 0 | 306287 | |
| 2 | 242536 | |
| - | 198882 | |
| : | 198882 | |
| 8 | 103570 | 5.5% |
| 99441 | 5.3% | |
| 7 | 92231 | 4.9% |
| 3 | 87960 | 4.7% |
| 4 | 80406 | 4.3% |
| Other values (3) | 171598 |
order_approved_at
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 90733 |
|---|---|
| Distinct (%) | 91.4% |
| Missing | 160 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
| 2018-02-27 04:31:10 | 9 |
|---|---|
| 2018-02-06 05:31:52 | 7 |
| 2017-11-07 07:30:38 | 7 |
| 2017-12-05 10:30:42 | 7 |
| 2018-07-05 16:33:01 | 7 |
| Other values (90728) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1886339 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 83688 ? |
|---|---|
| Unique (%) | 84.3% |
Sample
| 1st row | 2017-10-02 11:07:15 |
|---|---|
| 2nd row | 2018-07-26 03:24:27 |
| 3rd row | 2018-08-08 08:55:23 |
| 4th row | 2017-11-18 19:45:59 |
| 5th row | 2018-02-13 22:20:29 |
Common Values
| Value | Count | Frequency (%) |
| 2018-02-27 04:31:10 | 9 | < 0.1% |
| 2018-02-06 05:31:52 | 7 | < 0.1% |
| 2017-11-07 07:30:38 | 7 | < 0.1% |
| 2017-12-05 10:30:42 | 7 | < 0.1% |
| 2018-07-05 16:33:01 | 7 | < 0.1% |
| 2017-11-07 07:30:29 | 7 | < 0.1% |
| 2018-01-10 10:32:03 | 7 | < 0.1% |
| 2018-02-27 04:31:01 | 7 | < 0.1% |
| 2018-07-23 12:32:17 | 6 | < 0.1% |
| 2018-03-27 04:08:34 | 6 | < 0.1% |
| Other values (90723) | 99211 | |
| (Missing) | 160 | 0.2% |
Length
| Value | Count | Frequency (%) |
| 2018-04-24 | 990 | 0.5% |
| 2017-11-24 | 799 | 0.4% |
| 2017-11-25 | 754 | 0.4% |
| 2018-07-05 | 697 | 0.4% |
| 2017-11-28 | 506 | 0.3% |
| 2018-08-07 | 444 | 0.2% |
| 2018-05-08 | 426 | 0.2% |
| 2018-08-20 | 426 | 0.2% |
| 2017-12-05 | 426 | 0.2% |
| 2018-01-22 | 408 | 0.2% |
| Other values (42347) | 192686 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 318624 | |
| 1 | 305309 | |
| 2 | 241070 | |
| - | 198562 | |
| : | 198562 | |
| 99281 | 5.3% | |
| 8 | 98283 | 5.2% |
| 5 | 95612 | 5.1% |
| 3 | 93087 | 4.9% |
| 7 | 87979 | 4.7% |
| Other values (3) | 149970 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1389934 | |
| Dash Punctuation | 198562 | 10.5% |
| Other Punctuation | 198562 | 10.5% |
| Space Separator | 99281 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 318624 | |
| 1 | 305309 | |
| 2 | 241070 | |
| 8 | 98283 | 7.1% |
| 5 | 95612 | 6.9% |
| 3 | 93087 | 6.7% |
| 7 | 87979 | 6.3% |
| 4 | 68887 | 5.0% |
| 6 | 42734 | 3.1% |
| 9 | 38349 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 198562 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 198562 |
Space Separator
| Value | Count | Frequency (%) |
| 99281 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1886339 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 318624 | |
| 1 | 305309 | |
| 2 | 241070 | |
| - | 198562 | |
| : | 198562 | |
| 99281 | 5.3% | |
| 8 | 98283 | 5.2% |
| 5 | 95612 | 5.1% |
| 3 | 93087 | 4.9% |
| 7 | 87979 | 4.7% |
| Other values (3) | 149970 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1886339 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 318624 | |
| 1 | 305309 | |
| 2 | 241070 | |
| - | 198562 | |
| : | 198562 | |
| 99281 | 5.3% | |
| 8 | 98283 | 5.2% |
| 5 | 95612 | 5.1% |
| 3 | 93087 | 4.9% |
| 7 | 87979 | 4.7% |
| Other values (3) | 149970 |
order_delivered_carrier_date
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 81018 |
|---|---|
| Distinct (%) | 83.0% |
| Missing | 1783 |
| Missing (%) | 1.8% |
| Memory size | 1.5 MiB |
| 2018-05-09 15:48:00 | 47 |
|---|---|
| 2018-05-10 18:29:00 | 32 |
| 2018-05-07 12:31:00 | 21 |
| 2018-07-24 16:07:00 | 16 |
| 2018-05-02 15:15:00 | 16 |
| Other values (81013) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1855502 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 70926 ? |
|---|---|
| Unique (%) | 72.6% |
Sample
| 1st row | 2017-10-04 19:55:00 |
|---|---|
| 2nd row | 2018-07-26 14:31:00 |
| 3rd row | 2018-08-08 13:50:00 |
| 4th row | 2017-11-22 13:39:59 |
| 5th row | 2018-02-14 19:46:34 |
Common Values
| Value | Count | Frequency (%) |
| 2018-05-09 15:48:00 | 47 | < 0.1% |
| 2018-05-10 18:29:00 | 32 | < 0.1% |
| 2018-05-07 12:31:00 | 21 | < 0.1% |
| 2018-07-24 16:07:00 | 16 | < 0.1% |
| 2018-05-02 15:15:00 | 16 | < 0.1% |
| 2018-07-17 14:16:00 | 15 | < 0.1% |
| 2018-05-16 13:44:00 | 15 | < 0.1% |
| 2018-08-03 15:10:00 | 15 | < 0.1% |
| 2018-08-08 15:01:00 | 15 | < 0.1% |
| 2018-05-17 15:06:00 | 14 | < 0.1% |
| Other values (81008) | 97452 | |
| (Missing) | 1783 | 1.8% |
Length
| Value | Count | Frequency (%) |
| 2017-11-28 | 707 | 0.4% |
| 2017-11-27 | 673 | 0.3% |
| 2017-11-29 | 566 | 0.3% |
| 2018-02-27 | 523 | 0.3% |
| 2018-03-27 | 511 | 0.3% |
| 2018-08-06 | 510 | 0.3% |
| 2017-11-30 | 489 | 0.3% |
| 2018-08-13 | 472 | 0.2% |
| 2018-05-15 | 451 | 0.2% |
| 2018-05-03 | 450 | 0.2% |
| Other values (37539) | 189964 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 338915 | |
| 1 | 288924 | |
| 2 | 230289 | |
| - | 195316 | |
| : | 195316 | |
| 8 | 103165 | 5.6% |
| 97658 | 5.3% | |
| 7 | 88752 | 4.8% |
| 3 | 81979 | 4.4% |
| 4 | 77011 | 4.2% |
| Other values (3) | 158177 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1367212 | |
| Dash Punctuation | 195316 | 10.5% |
| Other Punctuation | 195316 | 10.5% |
| Space Separator | 97658 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 338915 | |
| 1 | 288924 | |
| 2 | 230289 | |
| 8 | 103165 | 7.5% |
| 7 | 88752 | 6.5% |
| 3 | 81979 | 6.0% |
| 4 | 77011 | 5.6% |
| 5 | 74722 | 5.5% |
| 6 | 42928 | 3.1% |
| 9 | 40527 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 195316 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 195316 |
Space Separator
| Value | Count | Frequency (%) |
| 97658 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1855502 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 338915 | |
| 1 | 288924 | |
| 2 | 230289 | |
| - | 195316 | |
| : | 195316 | |
| 8 | 103165 | 5.6% |
| 97658 | 5.3% | |
| 7 | 88752 | 4.8% |
| 3 | 81979 | 4.4% |
| 4 | 77011 | 4.2% |
| Other values (3) | 158177 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1855502 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 338915 | |
| 1 | 288924 | |
| 2 | 230289 | |
| - | 195316 | |
| : | 195316 | |
| 8 | 103165 | 5.6% |
| 97658 | 5.3% | |
| 7 | 88752 | 4.8% |
| 3 | 81979 | 4.4% |
| 4 | 77011 | 4.2% |
| Other values (3) | 158177 |
order_delivered_customer_date
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 95664 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 2965 |
| Missing (%) | 3.0% |
| Memory size | 1.5 MiB |
| 2018-07-24 21:36:42 | 3 |
|---|---|
| 2018-05-08 23:38:46 | 3 |
| 2018-05-14 20:02:44 | 3 |
| 2017-12-02 00:26:45 | 3 |
| 2018-05-08 19:36:48 | 3 |
| Other values (95659) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1833044 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 94860 ? |
|---|---|
| Unique (%) | 98.3% |
Sample
| 1st row | 2017-10-10 21:25:13 |
|---|---|
| 2nd row | 2018-08-07 15:27:45 |
| 3rd row | 2018-08-17 18:06:29 |
| 4th row | 2017-12-02 00:28:42 |
| 5th row | 2018-02-16 18:17:02 |
Common Values
| Value | Count | Frequency (%) |
| 2018-07-24 21:36:42 | 3 | < 0.1% |
| 2018-05-08 23:38:46 | 3 | < 0.1% |
| 2018-05-14 20:02:44 | 3 | < 0.1% |
| 2017-12-02 00:26:45 | 3 | < 0.1% |
| 2018-05-08 19:36:48 | 3 | < 0.1% |
| 2016-10-27 17:32:07 | 3 | < 0.1% |
| 2018-02-14 21:09:19 | 3 | < 0.1% |
| 2017-06-19 18:47:51 | 3 | < 0.1% |
| 2018-03-16 13:28:28 | 2 | < 0.1% |
| 2018-07-06 16:32:39 | 2 | < 0.1% |
| Other values (95654) | 96448 | |
| (Missing) | 2965 | 3.0% |
Length
| Value | Count | Frequency (%) |
| 2018-08-27 | 446 | 0.2% |
| 2018-08-13 | 442 | 0.2% |
| 2018-05-14 | 434 | 0.2% |
| 2018-05-21 | 431 | 0.2% |
| 2018-05-18 | 425 | 0.2% |
| 2018-04-11 | 413 | 0.2% |
| 2017-12-11 | 412 | 0.2% |
| 2018-07-03 | 410 | 0.2% |
| 2018-05-03 | 409 | 0.2% |
| 2017-06-19 | 405 | 0.2% |
| Other values (41734) | 188725 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 282611 | |
| 0 | 281259 | |
| 2 | 243609 | |
| - | 192952 | |
| : | 192952 | |
| 8 | 113512 | |
| 96476 | 5.3% | |
| 3 | 89135 | 4.9% |
| 7 | 88999 | 4.9% |
| 4 | 83444 | 4.6% |
| Other values (3) | 168095 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1350664 | |
| Dash Punctuation | 192952 | 10.5% |
| Other Punctuation | 192952 | 10.5% |
| Space Separator | 96476 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 282611 | |
| 0 | 281259 | |
| 2 | 243609 | |
| 8 | 113512 | |
| 3 | 89135 | 6.6% |
| 7 | 88999 | 6.6% |
| 4 | 83444 | 6.2% |
| 5 | 78170 | 5.8% |
| 6 | 48254 | 3.6% |
| 9 | 41671 | 3.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 192952 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 192952 |
Space Separator
| Value | Count | Frequency (%) |
| 96476 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1833044 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 282611 | |
| 0 | 281259 | |
| 2 | 243609 | |
| - | 192952 | |
| : | 192952 | |
| 8 | 113512 | |
| 96476 | 5.3% | |
| 3 | 89135 | 4.9% |
| 7 | 88999 | 4.9% |
| 4 | 83444 | 4.6% |
| Other values (3) | 168095 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1833044 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 282611 | |
| 0 | 281259 | |
| 2 | 243609 | |
| - | 192952 | |
| : | 192952 | |
| 8 | 113512 | |
| 96476 | 5.3% | |
| 3 | 89135 | 4.9% |
| 7 | 88999 | 4.9% |
| 4 | 83444 | 4.6% |
| Other values (3) | 168095 |
order_estimated_delivery_date
Categorical
| Distinct | 459 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 2017-12-20 00:00:00 | 522 |
|---|---|
| 2018-03-12 00:00:00 | 516 |
| 2018-05-29 00:00:00 | 513 |
| 2018-03-13 00:00:00 | 513 |
| 2018-02-14 00:00:00 | 507 |
| Other values (454) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1889379 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2017-10-18 00:00:00 |
|---|---|
| 2nd row | 2018-08-13 00:00:00 |
| 3rd row | 2018-09-04 00:00:00 |
| 4th row | 2017-12-15 00:00:00 |
| 5th row | 2018-02-26 00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2017-12-20 00:00:00 | 522 | 0.5% |
| 2018-03-12 00:00:00 | 516 | 0.5% |
| 2018-05-29 00:00:00 | 513 | 0.5% |
| 2018-03-13 00:00:00 | 513 | 0.5% |
| 2018-02-14 00:00:00 | 507 | 0.5% |
| 2017-12-18 00:00:00 | 493 | 0.5% |
| 2018-05-28 00:00:00 | 492 | 0.5% |
| 2018-03-06 00:00:00 | 492 | 0.5% |
| 2018-02-06 00:00:00 | 491 | 0.5% |
| 2018-04-12 00:00:00 | 490 | 0.5% |
| Other values (449) | 94412 |
Length
| Value | Count | Frequency (%) |
| 00:00:00 | 99441 | |
| 2017-12-20 | 522 | 0.3% |
| 2018-03-12 | 516 | 0.3% |
| 2018-05-29 | 513 | 0.3% |
| 2018-03-13 | 513 | 0.3% |
| 2018-02-14 | 507 | 0.3% |
| 2017-12-18 | 493 | 0.2% |
| 2018-05-28 | 492 | 0.2% |
| 2018-03-06 | 492 | 0.2% |
| 2018-02-06 | 491 | 0.2% |
| Other values (450) | 94902 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 822625 | |
| - | 198882 | 10.5% |
| : | 198882 | 10.5% |
| 1 | 170040 | 9.0% |
| 2 | 155085 | 8.2% |
| 99441 | 5.3% | |
| 8 | 82640 | 4.4% |
| 7 | 60296 | 3.2% |
| 3 | 26615 | 1.4% |
| 5 | 20381 | 1.1% |
| Other values (3) | 54492 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1392174 | |
| Dash Punctuation | 198882 | 10.5% |
| Other Punctuation | 198882 | 10.5% |
| Space Separator | 99441 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 822625 | |
| 1 | 170040 | 12.2% |
| 2 | 155085 | 11.1% |
| 8 | 82640 | 5.9% |
| 7 | 60296 | 4.3% |
| 3 | 26615 | 1.9% |
| 5 | 20381 | 1.5% |
| 6 | 19355 | 1.4% |
| 4 | 18619 | 1.3% |
| 9 | 16518 | 1.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 198882 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 198882 |
Space Separator
| Value | Count | Frequency (%) |
| 99441 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1889379 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 822625 | |
| - | 198882 | 10.5% |
| : | 198882 | 10.5% |
| 1 | 170040 | 9.0% |
| 2 | 155085 | 8.2% |
| 99441 | 5.3% | |
| 8 | 82640 | 4.4% |
| 7 | 60296 | 3.2% |
| 3 | 26615 | 1.4% |
| 5 | 20381 | 1.1% |
| Other values (3) | 54492 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1889379 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 822625 | |
| - | 198882 | 10.5% |
| : | 198882 | 10.5% |
| 1 | 170040 | 9.0% |
| 2 | 155085 | 8.2% |
| 99441 | 5.3% | |
| 8 | 82640 | 4.4% |
| 7 | 60296 | 3.2% |
| 3 | 26615 | 1.4% |
| 5 | 20381 | 1.1% |
| Other values (3) | 54492 | 2.9% |
review_score
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 768 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| 5.0 | |
|---|---|
| 4.0 | |
| 1.0 | |
| 3.0 | |
| 2.0 | 3131 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 296019 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4.0 |
|---|---|
| 2nd row | 4.0 |
| 3rd row | 5.0 |
| 4th row | 5.0 |
| 5th row | 5.0 |
Common Values
| Value | Count | Frequency (%) |
| 5.0 | 57007 | |
| 4.0 | 19038 | 19.1% |
| 1.0 | 11363 | 11.4% |
| 3.0 | 8134 | 8.2% |
| 2.0 | 3131 | 3.1% |
| (Missing) | 768 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5.0 | 57007 | |
| 4.0 | 19038 | 19.3% |
| 1.0 | 11363 | 11.5% |
| 3.0 | 8134 | 8.2% |
| 2.0 | 3131 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 98673 | |
| 0 | 98673 | |
| 5 | 57007 | |
| 4 | 19038 | 6.4% |
| 1 | 11363 | 3.8% |
| 3 | 8134 | 2.7% |
| 2 | 3131 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 197346 | |
| Other Punctuation | 98673 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 98673 | |
| 5 | 57007 | |
| 4 | 19038 | 9.6% |
| 1 | 11363 | 5.8% |
| 3 | 8134 | 4.1% |
| 2 | 3131 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 98673 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 296019 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 98673 | |
| 0 | 98673 | |
| 5 | 57007 | |
| 4 | 19038 | 6.4% |
| 1 | 11363 | 3.8% |
| 3 | 8134 | 2.7% |
| 2 | 3131 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 296019 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 98673 | |
| 0 | 98673 | |
| 5 | 57007 | |
| 4 | 19038 | 6.4% |
| 1 | 11363 | 3.8% |
| 3 | 8134 | 2.7% |
| 2 | 3131 | 1.1% |
length_comment_title
Real number (ℝ)
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 768 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3783608 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 87122 |
| Zeros (%) | 87.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 12 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.3584435 |
|---|---|
| Coefficient of variation (CV) | 3.1620483 |
| Kurtosis | 11.679804 |
| Mean | 1.3783608 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4581046 |
| Sum | 136007 |
| Variance | 18.99603 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 87122 | |
| 9 | 2000 | 2.0% |
| 5 | 1117 | 1.1% |
| 15 | 870 | 0.9% |
| 3 | 703 | 0.7% |
| 10 | 575 | 0.6% |
| 13 | 489 | 0.5% |
| 17 | 482 | 0.5% |
| 25 | 429 | 0.4% |
| 14 | 399 | 0.4% |
| Other values (17) | 4487 | 4.5% |
| (Missing) | 768 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 87122 | |
| 1 | 164 | 0.2% |
| 2 | 253 | 0.3% |
| 3 | 703 | 0.7% |
| 4 | 178 | 0.2% |
| 5 | 1117 | 1.1% |
| 6 | 243 | 0.2% |
| 7 | 388 | 0.4% |
| 8 | 342 | 0.3% |
| 9 | 2000 | 2.0% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 25 | 429 | |
| 24 | 221 | |
| 23 | 213 | |
| 22 | 213 | |
| 21 | 239 | |
| 20 | 390 | |
| 19 | 268 | |
| 18 | 301 | |
| 17 | 482 |
length_comment_message
Real number (ℝ)
| Distinct | 209 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 768 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.274888 |
| Minimum | 0 |
|---|---|
| Maximum | 208 |
| Zeros | 57894 |
| Zeros (%) | 58.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 42 |
| 95-th percentile | 146 |
| Maximum | 208 |
| Range | 208 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 48.294605 |
|---|---|
| Coefficient of variation (CV) | 1.7080388 |
| Kurtosis | 3.3124722 |
| Mean | 28.274888 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.9820518 |
| Sum | 2789968 |
| Variance | 2332.3689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 57894 | |
| 9 | 1006 | 1.0% |
| 200 | 594 | 0.6% |
| 5 | 558 | 0.6% |
| 3 | 514 | 0.5% |
| 26 | 503 | 0.5% |
| 20 | 475 | 0.5% |
| 10 | 469 | 0.5% |
| 34 | 464 | 0.5% |
| 31 | 451 | 0.5% |
| Other values (199) | 35745 | |
| (Missing) | 768 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 57894 | |
| 1 | 97 | 0.1% |
| 2 | 197 | 0.2% |
| 3 | 514 | 0.5% |
| 4 | 101 | 0.1% |
| 5 | 558 | 0.6% |
| 6 | 205 | 0.2% |
| 7 | 236 | 0.2% |
| 8 | 244 | 0.2% |
| 9 | 1006 | 1.0% |
| Value | Count | Frequency (%) |
| 208 | 1 | < 0.1% |
| 207 | 1 | < 0.1% |
| 206 | 1 | < 0.1% |
| 205 | 1 | < 0.1% |
| 204 | 15 | < 0.1% |
| 203 | 17 | < 0.1% |
| 202 | 12 | < 0.1% |
| 201 | 22 | < 0.1% |
| 200 | 594 | |
| 199 | 334 |
| Distinct | 97966 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 768 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| Minimum | 2016-10-07 18:32:28 |
|---|---|
| Maximum | 2018-10-29 12:27:35 |
payment_type
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 1.5 MiB |
| credit_card | |
|---|---|
| boleto | |
| credit_card,voucher | 2245 |
| voucher | 1621 |
| debit_card | 1527 |
| Other values (2) | 4 |
Length
| Max length | 22 |
|---|---|
| Median length | 11 |
| Mean length | 10.10539 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1004880 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | credit_card,voucher |
|---|---|
| 2nd row | boleto |
| 3rd row | credit_card |
| 4th row | credit_card |
| 5th row | credit_card |
Common Values
| Value | Count | Frequency (%) |
| credit_card | 74259 | |
| boleto | 19784 | 19.9% |
| credit_card,voucher | 2245 | 2.3% |
| voucher | 1621 | 1.6% |
| debit_card | 1527 | 1.5% |
| not_defined | 3 | < 0.1% |
| credit_card,debit_card | 1 | < 0.1% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| credit_card | 74259 | |
| boleto | 19784 | 19.9% |
| credit_card,voucher | 2245 | 2.3% |
| voucher | 1621 | 1.6% |
| debit_card | 1527 | 1.5% |
| not_defined | 3 | < 0.1% |
| credit_card,debit_card | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 158404 | |
| r | 158404 | |
| d | 156072 | |
| e | 101689 | |
| t | 97820 | |
| i | 78036 | |
| _ | 78036 | |
| a | 78033 | |
| o | 43437 | 4.3% |
| b | 21312 | 2.1% |
| Other values (7) | 33637 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 924598 | |
| Connector Punctuation | 78036 | 7.8% |
| Other Punctuation | 2246 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 158404 | |
| r | 158404 | |
| d | 156072 | |
| e | 101689 | |
| t | 97820 | |
| i | 78036 | |
| a | 78033 | |
| o | 43437 | 4.7% |
| b | 21312 | 2.3% |
| l | 19784 | 2.1% |
| Other values (5) | 11607 | 1.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 78036 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2246 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 924598 | |
| Common | 80282 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 158404 | |
| r | 158404 | |
| d | 156072 | |
| e | 101689 | |
| t | 97820 | |
| i | 78036 | |
| a | 78033 | |
| o | 43437 | 4.7% |
| b | 21312 | 2.3% |
| l | 19784 | 2.1% |
| Other values (5) | 11607 | 1.3% |
Common
| Value | Count | Frequency (%) |
| _ | 78036 | |
| , | 2246 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1004880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 158404 | |
| r | 158404 | |
| d | 156072 | |
| e | 101689 | |
| t | 97820 | |
| i | 78036 | |
| _ | 78036 | |
| a | 78033 | |
| o | 43437 | 4.3% |
| b | 21312 | 2.1% |
| Other values (7) | 33637 | 3.3% |
payment_sequential
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0447104 |
| Minimum | 1 |
|---|---|
| Maximum | 29 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 29 |
| Range | 28 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.38116563 |
|---|---|
| Coefficient of variation (CV) | 0.36485292 |
| Kurtosis | 1008.6736 |
| Mean | 1.0447104 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.948637 |
| Sum | 103886 |
| Variance | 0.14528724 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 96479 | |
| 2 | 2382 | 2.4% |
| 3 | 301 | 0.3% |
| 4 | 108 | 0.1% |
| 5 | 52 | 0.1% |
| 6 | 36 | < 0.1% |
| 7 | 28 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 9 | < 0.1% |
| 12 | 8 | < 0.1% |
| Other values (10) | 26 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 96479 | |
| 2 | 2382 | 2.4% |
| 3 | 301 | 0.3% |
| 4 | 108 | 0.1% |
| 5 | 52 | 0.1% |
| 6 | 36 | < 0.1% |
| 7 | 28 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 9 | < 0.1% |
| 10 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 29 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 8 | |
| 11 | 8 |
payment_installments
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9305209 |
| Minimum | 0 |
|---|---|
| Maximum | 24 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.7156847 |
|---|---|
| Coefficient of variation (CV) | 0.92669008 |
| Kurtosis | 2.3525407 |
| Mean | 2.9305209 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.5994122 |
| Sum | 291411 |
| Variance | 7.3749431 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 48268 | |
| 2 | 12363 | 12.4% |
| 3 | 10429 | 10.5% |
| 4 | 7070 | 7.1% |
| 10 | 5315 | 5.3% |
| 5 | 5227 | 5.3% |
| 8 | 4251 | 4.3% |
| 6 | 3908 | 3.9% |
| 7 | 1622 | 1.6% |
| 9 | 644 | 0.6% |
| Other values (14) | 343 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 48268 | |
| 2 | 12363 | 12.4% |
| 3 | 10429 | 10.5% |
| 4 | 7070 | 7.1% |
| 5 | 5227 | 5.3% |
| 6 | 3908 | 3.9% |
| 7 | 1622 | 1.6% |
| 8 | 4251 | 4.3% |
| 9 | 644 | 0.6% |
| Value | Count | Frequency (%) |
| 24 | 18 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 3 | < 0.1% |
| 20 | 17 | < 0.1% |
| 18 | 27 | < 0.1% |
| 17 | 8 | < 0.1% |
| 16 | 5 | < 0.1% |
| 15 | 74 | |
| 14 | 15 | < 0.1% |
payment_value
Real number (ℝ)
| Distinct | 27979 |
|---|---|
| Distinct (%) | 28.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 160.99027 |
| Minimum | 0 |
|---|---|
| Maximum | 13664.08 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 32.38 |
| Q1 | 62.01 |
| median | 105.29 |
| Q3 | 176.97 |
| 95-th percentile | 452.9875 |
| Maximum | 13664.08 |
| Range | 13664.08 |
| Interquartile range (IQR) | 114.96 |
Descriptive statistics
| Standard deviation | 221.95126 |
|---|---|
| Coefficient of variation (CV) | 1.3786626 |
| Kurtosis | 233.40652 |
| Mean | 160.99027 |
| Median Absolute Deviation (MAD) | 51.61 |
| Skewness | 9.1501694 |
| Sum | 16008872 |
| Variance | 49262.36 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 77.57 | 254 | 0.3% |
| 35 | 169 | 0.2% |
| 73.34 | 163 | 0.2% |
| 116.94 | 132 | 0.1% |
| 56.78 | 124 | 0.1% |
| 107.78 | 121 | 0.1% |
| 65 | 117 | 0.1% |
| 86.15 | 107 | 0.1% |
| 99.9 | 106 | 0.1% |
| 67.5 | 105 | 0.1% |
| Other values (27969) | 98042 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 9.59 | 1 | < 0.1% |
| 10.07 | 1 | < 0.1% |
| 10.89 | 1 | < 0.1% |
| 11.56 | 1 | < 0.1% |
| 11.62 | 1 | < 0.1% |
| 11.63 | 2 | |
| 12.22 | 1 | < 0.1% |
| 12.28 | 1 | < 0.1% |
| 12.39 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13664.08 | 1 | |
| 7274.88 | 1 | |
| 6929.31 | 1 | |
| 6922.21 | 1 | |
| 6726.66 | 1 | |
| 6081.54 | 1 | |
| 4950.34 | 1 | |
| 4809.44 | 1 | |
| 4764.34 | 1 | |
| 4681.78 | 1 |
product_most_frequent
Categorical
| Distinct | 31847 |
|---|---|
| Distinct (%) | 32.3% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 |
|---|---|
| 99a4788cb24856965c36a24e339b6058 | 427 |
| 422879e10f46682990de24d770e7f83d | 339 |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 |
| Other values (31842) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3157312 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18956 ? |
|---|---|
| Unique (%) | 19.2% |
Sample
| 1st row | 87285b34884572647811a353c7ac498a |
|---|---|
| 2nd row | 595fac2a385ac33a80bd5114aec74eb8 |
| 3rd row | aa4383b373c6aca5d8797843e5594415 |
| 4th row | d0b61bfb1de832b15ba9d266ca96e5b0 |
| 5th row | 65266b2da20d04dbe00c5c2d3bb7859e |
Common Values
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 427 | 0.4% |
| 422879e10f46682990de24d770e7f83d | 339 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 | 0.3% |
| 389d119b48cf3043d311335e499d9c6b | 299 | 0.3% |
| 368c6c730842d78016ad823897a372db | 285 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 269 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 264 | 0.3% |
| 2b4609f8948be18874494203496bc318 | 258 | 0.3% |
| Other values (31837) | 95482 | |
| (Missing) | 775 | 0.8% |
Length
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 427 | 0.4% |
| 422879e10f46682990de24d770e7f83d | 339 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 | 0.3% |
| 389d119b48cf3043d311335e499d9c6b | 299 | 0.3% |
| 368c6c730842d78016ad823897a372db | 285 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 269 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 264 | 0.3% |
| 2b4609f8948be18874494203496bc318 | 258 | 0.3% |
| Other values (31837) | 95482 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 202985 | 6.4% |
| 9 | 200585 | 6.4% |
| 8 | 199130 | 6.3% |
| e | 198981 | 6.3% |
| 7 | 198253 | 6.3% |
| a | 198185 | 6.3% |
| 4 | 198184 | 6.3% |
| 0 | 197884 | 6.3% |
| c | 197489 | 6.3% |
| 5 | 196954 | 6.2% |
| Other values (6) | 1168682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1982405 | |
| Lowercase Letter | 1174907 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 202985 | |
| 9 | 200585 | |
| 8 | 199130 | |
| 7 | 198253 | |
| 4 | 198184 | |
| 0 | 197884 | |
| 5 | 196954 | |
| 2 | 196887 | |
| 6 | 196022 | |
| 1 | 195521 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 198981 | |
| a | 198185 | |
| c | 197489 | |
| b | 195448 | |
| d | 193791 | |
| f | 191013 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1982405 | |
| Latin | 1174907 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 202985 | |
| 9 | 200585 | |
| 8 | 199130 | |
| 7 | 198253 | |
| 4 | 198184 | |
| 0 | 197884 | |
| 5 | 196954 | |
| 2 | 196887 | |
| 6 | 196022 | |
| 1 | 195521 |
Latin
| Value | Count | Frequency (%) |
| e | 198981 | |
| a | 198185 | |
| c | 197489 | |
| b | 195448 | |
| d | 193791 | |
| f | 191013 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3157312 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 202985 | 6.4% |
| 9 | 200585 | 6.4% |
| 8 | 199130 | 6.3% |
| e | 198981 | 6.3% |
| 7 | 198253 | 6.3% |
| a | 198185 | 6.3% |
| 4 | 198184 | 6.3% |
| 0 | 197884 | 6.3% |
| c | 197489 | 6.3% |
| 5 | 196954 | 6.2% |
| Other values (6) | 1168682 |
nb_items
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1417307 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.53845237 |
|---|---|
| Coefficient of variation (CV) | 0.47161067 |
| Kurtosis | 114.85034 |
| Mean | 1.1417307 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.5270056 |
| Sum | 112650 |
| Variance | 0.28993096 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 88863 | |
| 2 | 7516 | 7.6% |
| 3 | 1322 | 1.3% |
| 4 | 505 | 0.5% |
| 5 | 204 | 0.2% |
| 6 | 198 | 0.2% |
| 7 | 22 | < 0.1% |
| 10 | 8 | < 0.1% |
| 8 | 8 | < 0.1% |
| 12 | 5 | < 0.1% |
| Other values (7) | 15 | < 0.1% |
| (Missing) | 775 | 0.8% |
| Value | Count | Frequency (%) |
| 1 | 88863 | |
| 2 | 7516 | 7.6% |
| 3 | 1322 | 1.3% |
| 4 | 505 | 0.5% |
| 5 | 204 | 0.2% |
| 6 | 198 | 0.2% |
| 7 | 22 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 5 | |
| 11 | 4 | |
| 10 | 8 | |
| 9 | 3 | < 0.1% |
| 8 | 8 |
sum_price
Real number (ℝ)
| Distinct | 7761 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137.75408 |
| Minimum | 0.85 |
|---|---|
| Maximum | 13440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0.85 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 45.9 |
| median | 86.9 |
| Q3 | 149.9 |
| 95-th percentile | 399.9 |
| Maximum | 13440 |
| Range | 13439.15 |
| Interquartile range (IQR) | 104 |
Descriptive statistics
| Standard deviation | 210.64515 |
|---|---|
| Coefficient of variation (CV) | 1.5291391 |
| Kurtosis | 266.06694 |
| Mean | 137.75408 |
| Median Absolute Deviation (MAD) | 47.9 |
| Skewness | 9.7277407 |
| Sum | 13591644 |
| Variance | 44371.377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59.9 | 1723 | 1.7% |
| 69.9 | 1605 | 1.6% |
| 49.9 | 1420 | 1.4% |
| 89.9 | 1248 | 1.3% |
| 99.9 | 1191 | 1.2% |
| 79.9 | 1009 | 1.0% |
| 39.9 | 978 | 1.0% |
| 29.9 | 964 | 1.0% |
| 19.9 | 915 | 0.9% |
| 29.99 | 872 | 0.9% |
| Other values (7751) | 86741 |
| Value | Count | Frequency (%) |
| 0.85 | 2 | |
| 2.2 | 1 | < 0.1% |
| 2.29 | 1 | < 0.1% |
| 2.9 | 1 | < 0.1% |
| 2.99 | 1 | < 0.1% |
| 3 | 2 | |
| 3.49 | 1 | < 0.1% |
| 3.5 | 2 | |
| 3.54 | 1 | < 0.1% |
| 3.85 | 3 |
| Value | Count | Frequency (%) |
| 13440 | 1 | |
| 7160 | 1 | |
| 6735 | 1 | |
| 6729 | 1 | |
| 6499 | 1 | |
| 5934.6 | 1 | |
| 4799 | 1 | |
| 4690 | 1 | |
| 4599.9 | 1 | |
| 4590 | 1 |
sum_freight_value
Real number (ℝ)
| Distinct | 7970 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.823562 |
| Minimum | 0 |
|---|---|
| Maximum | 1794.96 |
| Zeros | 338 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.88 |
| Q1 | 13.85 |
| median | 17.17 |
| Q3 | 24.04 |
| 95-th percentile | 54.96 |
| Maximum | 1794.96 |
| Range | 1794.96 |
| Interquartile range (IQR) | 10.19 |
Descriptive statistics
| Standard deviation | 21.650909 |
|---|---|
| Coefficient of variation (CV) | 0.94862098 |
| Kurtosis | 565.34173 |
| Mean | 22.823562 |
| Median Absolute Deviation (MAD) | 4.38 |
| Skewness | 12.052723 |
| Sum | 2251909.5 |
| Variance | 468.76188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.1 | 2952 | 3.0% |
| 7.78 | 1839 | 1.8% |
| 14.1 | 1529 | 1.5% |
| 11.85 | 1444 | 1.5% |
| 18.23 | 1219 | 1.2% |
| 7.39 | 1137 | 1.1% |
| 15.23 | 823 | 0.8% |
| 16.11 | 795 | 0.8% |
| 8.72 | 766 | 0.8% |
| 16.79 | 697 | 0.7% |
| Other values (7960) | 85465 | |
| (Missing) | 775 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 338 | |
| 5.7 | 1 | < 0.1% |
| 5.82 | 1 | < 0.1% |
| 5.88 | 2 | < 0.1% |
| 6.52 | 1 | < 0.1% |
| 6.53 | 2 | < 0.1% |
| 6.56 | 1 | < 0.1% |
| 6.57 | 5 | < 0.1% |
| 6.78 | 5 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1794.96 | 1 | |
| 1002.29 | 1 | |
| 711.33 | 1 | |
| 626.64 | 1 | |
| 502.98 | 1 | |
| 497.42 | 1 | |
| 497.08 | 1 | |
| 479.28 | 1 | |
| 458.73 | 1 | |
| 456.47 | 1 |
customer_unique_id
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 96096 |
|---|---|
| Distinct (%) | 96.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 17 |
|---|---|
| 3e43e6105506432c953e165fb2acf44c | 9 |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 |
| ca77025e7201e3b30c44b472ff346268 | 7 |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 |
| Other values (96091) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3182112 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 93099 ? |
|---|---|
| Unique (%) | 93.6% |
Sample
| 1st row | 7c396fd4830fd04220f754e42b4e5bff |
|---|---|
| 2nd row | af07308b275d755c9edb36a90c618231 |
| 3rd row | 3a653a41f6f9fc3d2a113cf8398680e8 |
| 4th row | 7c142cf63193a1473d2e66489a9ae977 |
| 5th row | 72632f0f9dd73dfee390c9b22eb56dd6 |
Common Values
| Value | Count | Frequency (%) |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 17 | < 0.1% |
| 3e43e6105506432c953e165fb2acf44c | 9 | < 0.1% |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 | < 0.1% |
| ca77025e7201e3b30c44b472ff346268 | 7 | < 0.1% |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 | < 0.1% |
| de34b16117594161a6a89c50b289d35a | 6 | < 0.1% |
| 47c1a3033b8b77b3ab6e109eb4d5fdf3 | 6 | < 0.1% |
| 63cfc61cee11cbe306bff5857d00bfe4 | 6 | < 0.1% |
| 12f5d6e1cbf93dafd9dcc19095df0b3d | 6 | < 0.1% |
| f0e310a6839dce9de1638e0fe5ab282a | 6 | < 0.1% |
| Other values (96086) | 99364 |
Length
| Value | Count | Frequency (%) |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 17 | < 0.1% |
| 3e43e6105506432c953e165fb2acf44c | 9 | < 0.1% |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 | < 0.1% |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 | < 0.1% |
| ca77025e7201e3b30c44b472ff346268 | 7 | < 0.1% |
| de34b16117594161a6a89c50b289d35a | 6 | < 0.1% |
| 47c1a3033b8b77b3ab6e109eb4d5fdf3 | 6 | < 0.1% |
| 63cfc61cee11cbe306bff5857d00bfe4 | 6 | < 0.1% |
| 12f5d6e1cbf93dafd9dcc19095df0b3d | 6 | < 0.1% |
| f0e310a6839dce9de1638e0fe5ab282a | 6 | < 0.1% |
| Other values (96086) | 99364 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 199366 | 6.3% |
| 8 | 199355 | 6.3% |
| 1 | 199334 | 6.3% |
| a | 199132 | 6.3% |
| d | 199088 | 6.3% |
| b | 199054 | 6.3% |
| 5 | 199029 | 6.3% |
| 0 | 199023 | 6.3% |
| 2 | 198902 | 6.3% |
| e | 198867 | 6.2% |
| Other values (6) | 1190962 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1989082 | |
| Lowercase Letter | 1193030 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 199366 | |
| 8 | 199355 | |
| 1 | 199334 | |
| 5 | 199029 | |
| 0 | 199023 | |
| 2 | 198902 | |
| 9 | 198798 | |
| 3 | 198645 | |
| 4 | 198396 | |
| 7 | 198234 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 199132 | |
| d | 199088 | |
| b | 199054 | |
| e | 198867 | |
| f | 198661 | |
| c | 198228 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1989082 | |
| Latin | 1193030 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 199366 | |
| 8 | 199355 | |
| 1 | 199334 | |
| 5 | 199029 | |
| 0 | 199023 | |
| 2 | 198902 | |
| 9 | 198798 | |
| 3 | 198645 | |
| 4 | 198396 | |
| 7 | 198234 |
Latin
| Value | Count | Frequency (%) |
| a | 199132 | |
| d | 199088 | |
| b | 199054 | |
| e | 198867 | |
| f | 198661 | |
| c | 198228 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3182112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 199366 | 6.3% |
| 8 | 199355 | 6.3% |
| 1 | 199334 | 6.3% |
| a | 199132 | 6.3% |
| d | 199088 | 6.3% |
| b | 199054 | 6.3% |
| 5 | 199029 | 6.3% |
| 0 | 199023 | 6.3% |
| 2 | 198902 | 6.3% |
| e | 198867 | 6.2% |
| Other values (6) | 1190962 |
customer_zip_code_prefix
Real number (ℝ)
| Distinct | 14994 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35137.475 |
| Minimum | 1003 |
|---|---|
| Maximum | 99990 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1003 |
|---|---|
| 5-th percentile | 3315 |
| Q1 | 11347 |
| median | 24416 |
| Q3 | 58900 |
| 95-th percentile | 90550 |
| Maximum | 99990 |
| Range | 98987 |
| Interquartile range (IQR) | 47553 |
Descriptive statistics
| Standard deviation | 29797.939 |
|---|---|
| Coefficient of variation (CV) | 0.84803872 |
| Kurtosis | -0.78820393 |
| Mean | 35137.475 |
| Median Absolute Deviation (MAD) | 16386 |
| Skewness | 0.77902506 |
| Sum | 3.4941056 × 109 |
| Variance | 8.8791717 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22790 | 142 | 0.1% |
| 24220 | 124 | 0.1% |
| 22793 | 121 | 0.1% |
| 24230 | 117 | 0.1% |
| 22775 | 110 | 0.1% |
| 29101 | 101 | 0.1% |
| 13212 | 95 | 0.1% |
| 35162 | 93 | 0.1% |
| 22631 | 89 | 0.1% |
| 38400 | 87 | 0.1% |
| Other values (14984) | 98362 |
| Value | Count | Frequency (%) |
| 1003 | 1 | < 0.1% |
| 1004 | 2 | < 0.1% |
| 1005 | 6 | |
| 1006 | 2 | < 0.1% |
| 1007 | 4 | |
| 1008 | 4 | |
| 1009 | 7 | |
| 1011 | 5 | |
| 1012 | 3 | |
| 1013 | 3 |
| Value | Count | Frequency (%) |
| 99990 | 1 | < 0.1% |
| 99980 | 2 | < 0.1% |
| 99970 | 1 | < 0.1% |
| 99965 | 2 | < 0.1% |
| 99960 | 2 | < 0.1% |
| 99955 | 3 | < 0.1% |
| 99950 | 9 | |
| 99940 | 2 | < 0.1% |
| 99930 | 5 | |
| 99925 | 1 | < 0.1% |
customer_city
Categorical
| Distinct | 4119 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| sao paulo | |
|---|---|
| rio de janeiro | 6882 |
| belo horizonte | 2773 |
| brasilia | 2131 |
| curitiba | 1521 |
| Other values (4114) |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 10.344466 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1028664 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1144 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | sao paulo |
|---|---|
| 2nd row | barreiras |
| 3rd row | vianopolis |
| 4th row | sao goncalo do amarante |
| 5th row | santo andre |
Common Values
| Value | Count | Frequency (%) |
| sao paulo | 15540 | 15.6% |
| rio de janeiro | 6882 | 6.9% |
| belo horizonte | 2773 | 2.8% |
| brasilia | 2131 | 2.1% |
| curitiba | 1521 | 1.5% |
| campinas | 1444 | 1.5% |
| porto alegre | 1379 | 1.4% |
| salvador | 1245 | 1.3% |
| guarulhos | 1189 | 1.2% |
| sao bernardo do campo | 938 | 0.9% |
| Other values (4109) | 64399 |
Length
| Value | Count | Frequency (%) |
| sao | 21050 | 12.1% |
| paulo | 15606 | 9.0% |
| de | 9684 | 5.6% |
| rio | 8278 | 4.7% |
| janeiro | 6882 | 3.9% |
| do | 4276 | 2.5% |
| belo | 2833 | 1.6% |
| horizonte | 2798 | 1.6% |
| brasilia | 2140 | 1.2% |
| porto | 1648 | 0.9% |
| Other values (3285) | 99118 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 169618 | |
| o | 126534 | |
| i | 78754 | 7.7% |
| r | 76497 | 7.4% |
| 74872 | 7.3% | |
| e | 67028 | 6.5% |
| s | 62903 | 6.1% |
| n | 45721 | 4.4% |
| u | 44917 | 4.4% |
| l | 44815 | 4.4% |
| Other values (21) | 237005 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 953332 | |
| Space Separator | 74872 | 7.3% |
| Dash Punctuation | 232 | < 0.1% |
| Other Punctuation | 226 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 169618 | |
| o | 126534 | |
| i | 78754 | 8.3% |
| r | 76497 | 8.0% |
| e | 67028 | 7.0% |
| s | 62903 | 6.6% |
| n | 45721 | 4.8% |
| u | 44917 | 4.7% |
| l | 44815 | 4.7% |
| p | 37119 | 3.9% |
| Other values (16) | 199426 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 74872 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 232 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 226 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 953332 | |
| Common | 75332 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 169618 | |
| o | 126534 | |
| i | 78754 | 8.3% |
| r | 76497 | 8.0% |
| e | 67028 | 7.0% |
| s | 62903 | 6.6% |
| n | 45721 | 4.8% |
| u | 44917 | 4.7% |
| l | 44815 | 4.7% |
| p | 37119 | 3.9% |
| Other values (16) | 199426 |
Common
| Value | Count | Frequency (%) |
| 74872 | ||
| - | 232 | 0.3% |
| ' | 226 | 0.3% |
| 1 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1028664 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 169618 | |
| o | 126534 | |
| i | 78754 | 7.7% |
| r | 76497 | 7.4% |
| 74872 | 7.3% | |
| e | 67028 | 6.5% |
| s | 62903 | 6.1% |
| n | 45721 | 4.4% |
| u | 44917 | 4.4% |
| l | 44815 | 4.4% |
| Other values (21) | 237005 |
customer_state
Categorical
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| SP | |
|---|---|
| RJ | |
| MG | |
| RS | |
| PR | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 198882 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | BA |
| 3rd row | GO |
| 4th row | RN |
| 5th row | SP |
Common Values
| Value | Count | Frequency (%) |
| SP | 41746 | |
| RJ | 12852 | 12.9% |
| MG | 11635 | 11.7% |
| RS | 5466 | 5.5% |
| PR | 5045 | 5.1% |
| SC | 3637 | 3.7% |
| BA | 3380 | 3.4% |
| DF | 2140 | 2.2% |
| ES | 2033 | 2.0% |
| GO | 2020 | 2.0% |
| Other values (17) | 9487 | 9.5% |
Length
| Value | Count | Frequency (%) |
| sp | 41746 | |
| rj | 12852 | 12.9% |
| mg | 11635 | 11.7% |
| rs | 5466 | 5.5% |
| pr | 5045 | 5.1% |
| sc | 3637 | 3.7% |
| ba | 3380 | 3.4% |
| df | 2140 | 2.2% |
| es | 2033 | 2.0% |
| go | 2020 | 2.0% |
| Other values (17) | 9487 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 53947 | |
| P | 50517 | |
| R | 24193 | |
| M | 14152 | 7.1% |
| G | 13655 | 6.9% |
| J | 12852 | 6.5% |
| A | 5812 | 2.9% |
| E | 5371 | 2.7% |
| C | 5054 | 2.5% |
| B | 3916 | 2.0% |
| Other values (7) | 9413 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 198882 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 53947 | |
| P | 50517 | |
| R | 24193 | |
| M | 14152 | 7.1% |
| G | 13655 | 6.9% |
| J | 12852 | 6.5% |
| A | 5812 | 2.9% |
| E | 5371 | 2.7% |
| C | 5054 | 2.5% |
| B | 3916 | 2.0% |
| Other values (7) | 9413 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 198882 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 53947 | |
| P | 50517 | |
| R | 24193 | |
| M | 14152 | 7.1% |
| G | 13655 | 6.9% |
| J | 12852 | 6.5% |
| A | 5812 | 2.9% |
| E | 5371 | 2.7% |
| C | 5054 | 2.5% |
| B | 3916 | 2.0% |
| Other values (7) | 9413 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 198882 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 53947 | |
| P | 50517 | |
| R | 24193 | |
| M | 14152 | 7.1% |
| G | 13655 | 6.9% |
| J | 12852 | 6.5% |
| A | 5812 | 2.9% |
| E | 5371 | 2.7% |
| C | 5054 | 2.5% |
| B | 3916 | 2.0% |
| Other values (7) | 9413 | 4.7% |
product_id
Categorical
| Distinct | 31847 |
|---|---|
| Distinct (%) | 32.3% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 |
|---|---|
| 99a4788cb24856965c36a24e339b6058 | 427 |
| 422879e10f46682990de24d770e7f83d | 339 |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 |
| Other values (31842) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3157312 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18956 ? |
|---|---|
| Unique (%) | 19.2% |
Sample
| 1st row | 87285b34884572647811a353c7ac498a |
|---|---|
| 2nd row | 595fac2a385ac33a80bd5114aec74eb8 |
| 3rd row | aa4383b373c6aca5d8797843e5594415 |
| 4th row | d0b61bfb1de832b15ba9d266ca96e5b0 |
| 5th row | 65266b2da20d04dbe00c5c2d3bb7859e |
Common Values
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 427 | 0.4% |
| 422879e10f46682990de24d770e7f83d | 339 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 | 0.3% |
| 389d119b48cf3043d311335e499d9c6b | 299 | 0.3% |
| 368c6c730842d78016ad823897a372db | 285 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 269 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 264 | 0.3% |
| 2b4609f8948be18874494203496bc318 | 258 | 0.3% |
| Other values (31837) | 95482 | |
| (Missing) | 775 | 0.8% |
Length
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 427 | 0.4% |
| 422879e10f46682990de24d770e7f83d | 339 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 | 0.3% |
| 389d119b48cf3043d311335e499d9c6b | 299 | 0.3% |
| 368c6c730842d78016ad823897a372db | 285 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 269 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 264 | 0.3% |
| 2b4609f8948be18874494203496bc318 | 258 | 0.3% |
| Other values (31837) | 95482 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 202985 | 6.4% |
| 9 | 200585 | 6.4% |
| 8 | 199130 | 6.3% |
| e | 198981 | 6.3% |
| 7 | 198253 | 6.3% |
| a | 198185 | 6.3% |
| 4 | 198184 | 6.3% |
| 0 | 197884 | 6.3% |
| c | 197489 | 6.3% |
| 5 | 196954 | 6.2% |
| Other values (6) | 1168682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1982405 | |
| Lowercase Letter | 1174907 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 202985 | |
| 9 | 200585 | |
| 8 | 199130 | |
| 7 | 198253 | |
| 4 | 198184 | |
| 0 | 197884 | |
| 5 | 196954 | |
| 2 | 196887 | |
| 6 | 196022 | |
| 1 | 195521 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 198981 | |
| a | 198185 | |
| c | 197489 | |
| b | 195448 | |
| d | 193791 | |
| f | 191013 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1982405 | |
| Latin | 1174907 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 202985 | |
| 9 | 200585 | |
| 8 | 199130 | |
| 7 | 198253 | |
| 4 | 198184 | |
| 0 | 197884 | |
| 5 | 196954 | |
| 2 | 196887 | |
| 6 | 196022 | |
| 1 | 195521 |
Latin
| Value | Count | Frequency (%) |
| e | 198981 | |
| a | 198185 | |
| c | 197489 | |
| b | 195448 | |
| d | 193791 | |
| f | 191013 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3157312 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 202985 | 6.4% |
| 9 | 200585 | 6.4% |
| 8 | 199130 | 6.3% |
| e | 198981 | 6.3% |
| 7 | 198253 | 6.3% |
| a | 198185 | 6.3% |
| 4 | 198184 | 6.3% |
| 0 | 197884 | 6.3% |
| c | 197489 | 6.3% |
| 5 | 196954 | 6.2% |
| Other values (6) | 1168682 |
product_description_lenght
Real number (ℝ)
| Distinct | 2955 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 782.638 |
| Minimum | 0 |
|---|---|
| Maximum | 3992 |
| Zeros | 1420 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 138 |
| Q1 | 341 |
| median | 600 |
| Q3 | 988 |
| 95-th percentile | 2120 |
| Maximum | 3992 |
| Range | 3992 |
| Interquartile range (IQR) | 647 |
Descriptive statistics
| Standard deviation | 656.75087 |
|---|---|
| Coefficient of variation (CV) | 0.83915025 |
| Kurtosis | 4.7949182 |
| Mean | 782.638 |
| Median Absolute Deviation (MAD) | 300 |
| Skewness | 1.9719271 |
| Sum | 77219761 |
| Variance | 431321.71 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1420 | 1.4% |
| 1893 | 583 | 0.6% |
| 492 | 539 | 0.5% |
| 341 | 536 | 0.5% |
| 903 | 479 | 0.5% |
| 245 | 478 | 0.5% |
| 348 | 459 | 0.5% |
| 236 | 428 | 0.4% |
| 366 | 394 | 0.4% |
| 575 | 361 | 0.4% |
| Other values (2945) | 92989 | |
| (Missing) | 775 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 1420 | |
| 4 | 6 | < 0.1% |
| 8 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 20 | 6 | < 0.1% |
| 26 | 2 | < 0.1% |
| 27 | 3 | < 0.1% |
| 28 | 2 | < 0.1% |
| 30 | 7 | < 0.1% |
| 31 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3992 | 2 | |
| 3988 | 1 | < 0.1% |
| 3985 | 3 | |
| 3976 | 3 | |
| 3963 | 1 | < 0.1% |
| 3956 | 2 | |
| 3954 | 2 | |
| 3950 | 1 | < 0.1% |
| 3949 | 1 | < 0.1% |
| 3948 | 1 | < 0.1% |
product_photos_qty
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.217765 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 1420 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7535453 |
|---|---|
| Coefficient of variation (CV) | 0.79068131 |
| Kurtosis | 4.4497692 |
| Mean | 2.217765 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.8258834 |
| Sum | 218818 |
| Variance | 3.0749212 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 48050 | |
| 2 | 19157 | 19.3% |
| 3 | 11212 | 11.3% |
| 4 | 7587 | 7.6% |
| 5 | 4980 | 5.0% |
| 6 | 3397 | 3.4% |
| 0 | 1420 | 1.4% |
| 7 | 1405 | 1.4% |
| 8 | 683 | 0.7% |
| 10 | 321 | 0.3% |
| Other values (10) | 454 | 0.5% |
| (Missing) | 775 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 1420 | 1.4% |
| 1 | 48050 | |
| 2 | 19157 | 19.3% |
| 3 | 11212 | 11.3% |
| 4 | 7587 | 7.6% |
| 5 | 4980 | 5.0% |
| 6 | 3397 | 3.4% |
| 7 | 1405 | 1.4% |
| 8 | 683 | 0.7% |
| 9 | 289 | 0.3% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 4 | < 0.1% |
| 17 | 8 | < 0.1% |
| 15 | 12 | < 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 26 | < 0.1% |
| 12 | 44 | < 0.1% |
| 11 | 62 | 0.1% |
| 10 | 321 |
product_weight_g
Real number (ℝ)
| Distinct | 2190 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 791 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2101.1762 |
| Minimum | 0 |
|---|---|
| Maximum | 40425 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 125 |
| Q1 | 300 |
| median | 700 |
| Q3 | 1800 |
| 95-th percentile | 9750 |
| Maximum | 40425 |
| Range | 40425 |
| Interquartile range (IQR) | 1500 |
Descriptive statistics
| Standard deviation | 3763.2039 |
|---|---|
| Coefficient of variation (CV) | 1.7909987 |
| Kurtosis | 16.417829 |
| Mean | 2101.1762 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 3.6112048 |
| Sum | 2.0728104 × 108 |
| Variance | 14161704 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 5927 | 6.0% |
| 150 | 4636 | 4.7% |
| 250 | 3998 | 4.0% |
| 300 | 3735 | 3.8% |
| 400 | 3186 | 3.2% |
| 100 | 3112 | 3.1% |
| 350 | 2821 | 2.8% |
| 500 | 2362 | 2.4% |
| 600 | 2284 | 2.3% |
| 700 | 1751 | 1.8% |
| Other values (2180) | 64838 |
| Value | Count | Frequency (%) |
| 0 | 6 | < 0.1% |
| 2 | 5 | < 0.1% |
| 25 | 3 | < 0.1% |
| 50 | 841 | |
| 53 | 2 | < 0.1% |
| 54 | 1 | < 0.1% |
| 55 | 2 | < 0.1% |
| 58 | 1 | < 0.1% |
| 60 | 8 | < 0.1% |
| 61 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 40425 | 3 | < 0.1% |
| 30000 | 255 | |
| 29800 | 1 | < 0.1% |
| 29750 | 1 | < 0.1% |
| 29700 | 3 | < 0.1% |
| 29600 | 5 | < 0.1% |
| 29500 | 1 | < 0.1% |
| 29250 | 1 | < 0.1% |
| 29150 | 1 | < 0.1% |
| 29100 | 1 | < 0.1% |
product_length_cm
Real number (ℝ)
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 791 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.09779 |
| Minimum | 7 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 18 |
| median | 25 |
| Q3 | 38 |
| 95-th percentile | 62 |
| Maximum | 105 |
| Range | 98 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.125854 |
|---|---|
| Coefficient of variation (CV) | 0.53578201 |
| Kurtosis | 3.7918662 |
| Mean | 30.09779 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.7714081 |
| Sum | 2969147 |
| Variance | 260.04318 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 15278 | 15.4% |
| 20 | 9154 | 9.2% |
| 30 | 6324 | 6.4% |
| 17 | 5339 | 5.4% |
| 18 | 5164 | 5.2% |
| 19 | 4141 | 4.2% |
| 25 | 4130 | 4.2% |
| 40 | 3568 | 3.6% |
| 22 | 3435 | 3.5% |
| 35 | 2590 | 2.6% |
| Other values (89) | 39527 |
| Value | Count | Frequency (%) |
| 7 | 30 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 4 | < 0.1% |
| 10 | 7 | < 0.1% |
| 11 | 82 | 0.1% |
| 12 | 34 | < 0.1% |
| 13 | 50 | 0.1% |
| 14 | 119 | 0.1% |
| 15 | 178 | 0.2% |
| 16 | 15278 |
| Value | Count | Frequency (%) |
| 105 | 301 | |
| 104 | 29 | < 0.1% |
| 103 | 35 | < 0.1% |
| 102 | 42 | < 0.1% |
| 101 | 88 | 0.1% |
| 100 | 310 | |
| 99 | 33 | < 0.1% |
| 98 | 42 | < 0.1% |
| 97 | 10 | < 0.1% |
| 96 | 8 | < 0.1% |
product_height_cm
Real number (ℝ)
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 791 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.479078 |
| Minimum | 2 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 44 |
| Maximum | 105 |
| Range | 103 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 13.310002 |
|---|---|
| Coefficient of variation (CV) | 0.80769096 |
| Kurtosis | 7.4745489 |
| Mean | 16.479078 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.2576774 |
| Sum | 1625661 |
| Variance | 177.15615 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 8514 | 8.6% |
| 20 | 5857 | 5.9% |
| 15 | 5665 | 5.7% |
| 12 | 5640 | 5.7% |
| 11 | 5482 | 5.5% |
| 2 | 4438 | 4.5% |
| 4 | 4223 | 4.2% |
| 8 | 4063 | 4.1% |
| 16 | 3991 | 4.0% |
| 5 | 3920 | 3.9% |
| Other values (92) | 46857 |
| Value | Count | Frequency (%) |
| 2 | 4438 | |
| 3 | 2340 | 2.4% |
| 4 | 4223 | |
| 5 | 3920 | |
| 6 | 3027 | 3.0% |
| 7 | 3714 | |
| 8 | 4063 | |
| 9 | 2804 | 2.8% |
| 10 | 8514 | |
| 11 | 5482 |
| Value | Count | Frequency (%) |
| 105 | 109 | |
| 104 | 12 | < 0.1% |
| 103 | 37 | < 0.1% |
| 102 | 7 | < 0.1% |
| 100 | 39 | < 0.1% |
| 99 | 5 | < 0.1% |
| 98 | 3 | < 0.1% |
| 97 | 2 | < 0.1% |
| 96 | 8 | < 0.1% |
| 95 | 21 | < 0.1% |
product_width_cm
Real number (ℝ)
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 791 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.02002 |
| Minimum | 6 |
|---|---|
| Maximum | 118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 15 |
| median | 20 |
| Q3 | 30 |
| 95-th percentile | 45 |
| Maximum | 118 |
| Range | 112 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.746284 |
|---|---|
| Coefficient of variation (CV) | 0.51026386 |
| Kurtosis | 4.626345 |
| Mean | 23.02002 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.7227086 |
| Sum | 2270925 |
| Variance | 137.9752 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 10480 | 10.5% |
| 11 | 9185 | 9.2% |
| 15 | 7911 | 8.0% |
| 16 | 7387 | 7.4% |
| 30 | 6427 | 6.5% |
| 12 | 4846 | 4.9% |
| 13 | 4683 | 4.7% |
| 14 | 4079 | 4.1% |
| 18 | 3566 | 3.6% |
| 40 | 3383 | 3.4% |
| Other values (85) | 36703 |
| Value | Count | Frequency (%) |
| 6 | 2 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 48 | < 0.1% |
| 10 | 68 | 0.1% |
| 11 | 9185 | |
| 12 | 4846 | |
| 13 | 4683 | |
| 14 | 4079 | |
| 15 | 7911 |
| Value | Count | Frequency (%) |
| 118 | 7 | < 0.1% |
| 105 | 14 | < 0.1% |
| 104 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 102 | 2 | < 0.1% |
| 101 | 2 | < 0.1% |
| 100 | 41 | |
| 98 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 95 | 2 | < 0.1% |
product_category_name_english
Categorical
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 775 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| bed_bath_table | |
|---|---|
| health_beauty | |
| sports_leisure | |
| computers_accessories | |
| furniture_decor | |
| Other values (67) |
Length
| Max length | 39 |
|---|---|
| Median length | 31 |
| Mean length | 12.770934 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1260057 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | housewares |
|---|---|
| 2nd row | perfumery |
| 3rd row | auto |
| 4th row | pet_shop |
| 5th row | stationery |
Common Values
| Value | Count | Frequency (%) |
| bed_bath_table | 9301 | 9.4% |
| health_beauty | 8803 | 8.9% |
| sports_leisure | 7681 | 7.7% |
| computers_accessories | 6659 | 6.7% |
| furniture_decor | 6358 | 6.4% |
| housewares | 5820 | 5.9% |
| watches_gifts | 5607 | 5.6% |
| telephony | 4189 | 4.2% |
| auto | 3878 | 3.9% |
| toys | 3851 | 3.9% |
| Other values (62) | 36519 |
Length
| Value | Count | Frequency (%) |
| bed_bath_table | 9301 | 9.4% |
| health_beauty | 8803 | 8.9% |
| sports_leisure | 7681 | 7.8% |
| computers_accessories | 6659 | 6.7% |
| furniture_decor | 6358 | 6.4% |
| housewares | 5820 | 5.9% |
| watches_gifts | 5607 | 5.7% |
| telephony | 4189 | 4.2% |
| auto | 3878 | 3.9% |
| toys | 3851 | 3.9% |
| Other values (62) | 36519 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 152705 | |
| s | 119636 | 9.5% |
| t | 110914 | 8.8% |
| o | 93648 | 7.4% |
| a | 85653 | 6.8% |
| r | 84434 | 6.7% |
| _ | 83451 | 6.6% |
| u | 64761 | 5.1% |
| c | 59784 | 4.7% |
| i | 51976 | 4.1% |
| Other values (15) | 353095 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1176350 | |
| Connector Punctuation | 83451 | 6.6% |
| Decimal Number | 256 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 152705 | |
| s | 119636 | 10.2% |
| t | 110914 | 9.4% |
| o | 93648 | 8.0% |
| a | 85653 | 7.3% |
| r | 84434 | 7.2% |
| u | 64761 | 5.5% |
| c | 59784 | 5.1% |
| i | 51976 | 4.4% |
| h | 50327 | 4.3% |
| Other values (13) | 302512 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 83451 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 256 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1176350 | |
| Common | 83707 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 152705 | |
| s | 119636 | 10.2% |
| t | 110914 | 9.4% |
| o | 93648 | 8.0% |
| a | 85653 | 7.3% |
| r | 84434 | 7.2% |
| u | 64761 | 5.5% |
| c | 59784 | 5.1% |
| i | 51976 | 4.4% |
| h | 50327 | 4.3% |
| Other values (13) | 302512 |
Common
| Value | Count | Frequency (%) |
| _ | 83451 | |
| 2 | 256 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1260057 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 152705 | |
| s | 119636 | 9.5% |
| t | 110914 | 8.8% |
| o | 93648 | 7.4% |
| a | 85653 | 6.8% |
| r | 84434 | 6.7% |
| _ | 83451 | 6.6% |
| u | 64761 | 5.1% |
| c | 59784 | 4.7% |
| i | 51976 | 4.1% |
| Other values (15) | 353095 |
| length_comment_title | length_comment_message | payment_sequential | payment_installments | payment_value | nb_items | sum_price | sum_freight_value | customer_zip_code_prefix | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | order_status | review_score | payment_type | customer_state | product_category_name_english | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| length_comment_title | 1.000 | 0.313 | -0.006 | 0.004 | 0.033 | 0.026 | 0.028 | 0.053 | -0.016 | 0.030 | 0.006 | -0.011 | -0.032 | -0.003 | -0.020 | 0.021 | 0.080 | 0.028 | 0.013 | 0.034 |
| length_comment_message | 0.313 | 1.000 | 0.005 | 0.045 | 0.068 | 0.082 | 0.062 | 0.070 | 0.015 | -0.007 | -0.005 | 0.037 | 0.014 | 0.022 | 0.015 | 0.048 | 0.206 | 0.008 | 0.021 | 0.020 |
| payment_sequential | -0.006 | 0.005 | 1.000 | -0.063 | -0.008 | -0.004 | -0.010 | 0.007 | 0.006 | -0.010 | 0.001 | 0.010 | 0.013 | 0.001 | 0.014 | 0.010 | 0.000 | 0.108 | 0.005 | 0.000 |
| payment_installments | 0.004 | 0.045 | -0.063 | 1.000 | 0.382 | 0.057 | 0.375 | 0.231 | 0.070 | 0.037 | 0.004 | 0.220 | 0.119 | 0.122 | 0.137 | 0.005 | 0.020 | 0.182 | 0.033 | 0.090 |
| payment_value | 0.033 | 0.068 | -0.008 | 0.382 | 1.000 | 0.221 | 0.990 | 0.566 | 0.112 | 0.192 | 0.008 | 0.519 | 0.268 | 0.348 | 0.275 | 0.012 | 0.014 | 0.006 | 0.015 | 0.100 |
| nb_items | 0.026 | 0.082 | -0.004 | 0.057 | 0.221 | 1.000 | 0.177 | 0.377 | -0.008 | -0.037 | -0.056 | -0.004 | 0.008 | 0.004 | 0.001 | 0.000 | 0.031 | 0.009 | 0.000 | 0.027 |
| sum_price | 0.028 | 0.062 | -0.010 | 0.375 | 0.990 | 0.177 | 1.000 | 0.469 | 0.065 | 0.196 | 0.012 | 0.506 | 0.256 | 0.340 | 0.264 | 0.009 | 0.012 | 0.008 | 0.013 | 0.092 |
| sum_freight_value | 0.053 | 0.070 | 0.007 | 0.231 | 0.566 | 0.377 | 0.469 | 1.000 | 0.427 | 0.100 | -0.009 | 0.419 | 0.273 | 0.272 | 0.262 | 0.000 | 0.015 | 0.000 | 0.030 | 0.054 |
| customer_zip_code_prefix | -0.016 | 0.015 | 0.006 | 0.070 | 0.112 | -0.008 | 0.065 | 0.427 | 1.000 | 0.027 | 0.025 | 0.025 | 0.013 | 0.014 | -0.002 | 0.021 | 0.042 | 0.024 | 0.896 | 0.047 |
| product_description_lenght | 0.030 | -0.007 | -0.010 | 0.037 | 0.192 | -0.037 | 0.196 | 0.100 | 0.027 | 1.000 | 0.155 | 0.100 | -0.011 | 0.132 | -0.060 | 0.003 | 0.011 | 0.012 | 0.019 | 0.212 |
| product_photos_qty | 0.006 | -0.005 | 0.001 | 0.004 | 0.008 | -0.056 | 0.012 | -0.009 | 0.025 | 0.155 | 1.000 | 0.014 | 0.009 | -0.068 | -0.004 | 0.013 | 0.011 | 0.000 | 0.013 | 0.150 |
| product_weight_g | -0.011 | 0.037 | 0.010 | 0.220 | 0.519 | -0.004 | 0.506 | 0.419 | 0.025 | 0.100 | 0.014 | 1.000 | 0.620 | 0.536 | 0.622 | 0.004 | 0.019 | 0.011 | 0.013 | 0.192 |
| product_length_cm | -0.032 | 0.014 | 0.013 | 0.119 | 0.268 | 0.008 | 0.256 | 0.273 | 0.013 | -0.011 | 0.009 | 0.620 | 1.000 | 0.260 | 0.639 | 0.008 | 0.014 | 0.012 | 0.011 | 0.260 |
| product_height_cm | -0.003 | 0.022 | 0.001 | 0.122 | 0.348 | 0.004 | 0.340 | 0.272 | 0.014 | 0.132 | -0.068 | 0.536 | 0.260 | 1.000 | 0.346 | 0.011 | 0.014 | 0.011 | 0.013 | 0.266 |
| product_width_cm | -0.020 | 0.015 | 0.014 | 0.137 | 0.275 | 0.001 | 0.264 | 0.262 | -0.002 | -0.060 | -0.004 | 0.622 | 0.639 | 0.346 | 1.000 | 0.000 | 0.011 | 0.010 | 0.012 | 0.290 |
| order_status | 0.021 | 0.048 | 0.010 | 0.005 | 0.012 | 0.000 | 0.009 | 0.000 | 0.021 | 0.003 | 0.013 | 0.004 | 0.008 | 0.011 | 0.000 | 1.000 | 0.165 | 0.039 | 0.023 | 0.023 |
| review_score | 0.080 | 0.206 | 0.000 | 0.020 | 0.014 | 0.031 | 0.012 | 0.015 | 0.042 | 0.011 | 0.011 | 0.019 | 0.014 | 0.014 | 0.011 | 0.165 | 1.000 | 0.010 | 0.048 | 0.046 |
| payment_type | 0.028 | 0.008 | 0.108 | 0.182 | 0.006 | 0.009 | 0.008 | 0.000 | 0.024 | 0.012 | 0.000 | 0.011 | 0.012 | 0.011 | 0.010 | 0.039 | 0.010 | 1.000 | 0.026 | 0.036 |
| customer_state | 0.013 | 0.021 | 0.005 | 0.033 | 0.015 | 0.000 | 0.013 | 0.030 | 0.896 | 0.019 | 0.013 | 0.013 | 0.011 | 0.013 | 0.012 | 0.023 | 0.048 | 0.026 | 1.000 | 0.030 |
| product_category_name_english | 0.034 | 0.020 | 0.000 | 0.090 | 0.100 | 0.027 | 0.092 | 0.054 | 0.047 | 0.212 | 0.150 | 0.192 | 0.260 | 0.266 | 0.290 | 0.023 | 0.046 | 0.036 | 0.030 | 1.000 |
| order_id | customer_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | review_score | length_comment_title | length_comment_message | review_answer_timestamp | payment_type | payment_sequential | payment_installments | payment_value | product_most_frequent | nb_items | sum_price | sum_freight_value | customer_unique_id | customer_zip_code_prefix | customer_city | customer_state | product_id | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | e481f51cbdc54678b7cc49136f2d6af7 | 9ef432eb6251297304e76186b10a928d | delivered | 2017-10-02 10:56:33 | 2017-10-02 11:07:15 | 2017-10-04 19:55:00 | 2017-10-10 21:25:13 | 2017-10-18 00:00:00 | 4.0 | 0.0 | 170.0 | 2017-10-12 03:43:48 | credit_card,voucher | 3.0 | 1.0 | 38.71 | 87285b34884572647811a353c7ac498a | 1.0 | 29.99 | 8.72 | 7c396fd4830fd04220f754e42b4e5bff | 3149 | sao paulo | SP | 87285b34884572647811a353c7ac498a | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | housewares |
| 1 | 53cdb2fc8bc7dce0b6741e2150273451 | b0830fb4747a6c6d20dea0b8c802d7ef | delivered | 2018-07-24 20:41:37 | 2018-07-26 03:24:27 | 2018-07-26 14:31:00 | 2018-08-07 15:27:45 | 2018-08-13 00:00:00 | 4.0 | 16.0 | 20.0 | 2018-08-08 18:37:50 | boleto | 1.0 | 1.0 | 141.46 | 595fac2a385ac33a80bd5114aec74eb8 | 1.0 | 118.70 | 22.76 | af07308b275d755c9edb36a90c618231 | 47813 | barreiras | BA | 595fac2a385ac33a80bd5114aec74eb8 | 178.0 | 1.0 | 400.0 | 19.0 | 13.0 | 19.0 | perfumery |
| 2 | 47770eb9100c2d0c44946d9cf07ec65d | 41ce2a54c0b03bf3443c3d931a367089 | delivered | 2018-08-08 08:38:49 | 2018-08-08 08:55:23 | 2018-08-08 13:50:00 | 2018-08-17 18:06:29 | 2018-09-04 00:00:00 | 5.0 | 0.0 | 0.0 | 2018-08-22 19:07:58 | credit_card | 1.0 | 3.0 | 179.12 | aa4383b373c6aca5d8797843e5594415 | 1.0 | 159.90 | 19.22 | 3a653a41f6f9fc3d2a113cf8398680e8 | 75265 | vianopolis | GO | aa4383b373c6aca5d8797843e5594415 | 232.0 | 1.0 | 420.0 | 24.0 | 19.0 | 21.0 | auto |
| 3 | 949d5b44dbf5de918fe9c16f97b45f8a | f88197465ea7920adcdbec7375364d82 | delivered | 2017-11-18 19:28:06 | 2017-11-18 19:45:59 | 2017-11-22 13:39:59 | 2017-12-02 00:28:42 | 2017-12-15 00:00:00 | 5.0 | 0.0 | 105.0 | 2017-12-05 19:21:58 | credit_card | 1.0 | 1.0 | 72.20 | d0b61bfb1de832b15ba9d266ca96e5b0 | 1.0 | 45.00 | 27.20 | 7c142cf63193a1473d2e66489a9ae977 | 59296 | sao goncalo do amarante | RN | d0b61bfb1de832b15ba9d266ca96e5b0 | 468.0 | 3.0 | 450.0 | 30.0 | 10.0 | 20.0 | pet_shop |
| 4 | ad21c59c0840e6cb83a9ceb5573f8159 | 8ab97904e6daea8866dbdbc4fb7aad2c | delivered | 2018-02-13 21:18:39 | 2018-02-13 22:20:29 | 2018-02-14 19:46:34 | 2018-02-16 18:17:02 | 2018-02-26 00:00:00 | 5.0 | 0.0 | 0.0 | 2018-02-18 13:02:51 | credit_card | 1.0 | 1.0 | 28.62 | 65266b2da20d04dbe00c5c2d3bb7859e | 1.0 | 19.90 | 8.72 | 72632f0f9dd73dfee390c9b22eb56dd6 | 9195 | santo andre | SP | 65266b2da20d04dbe00c5c2d3bb7859e | 316.0 | 4.0 | 250.0 | 51.0 | 15.0 | 15.0 | stationery |
| 5 | a4591c265e18cb1dcee52889e2d8acc3 | 503740e9ca751ccdda7ba28e9ab8f608 | delivered | 2017-07-09 21:57:05 | 2017-07-09 22:10:13 | 2017-07-11 14:58:04 | 2017-07-26 10:57:55 | 2017-08-01 00:00:00 | 4.0 | 0.0 | 0.0 | 2017-07-27 22:48:30 | credit_card | 1.0 | 6.0 | 175.26 | 060cb19345d90064d1015407193c233d | 1.0 | 147.90 | 27.36 | 80bb27c7c16e8f973207a5086ab329e2 | 86320 | congonhinhas | PR | 060cb19345d90064d1015407193c233d | 608.0 | 1.0 | 7150.0 | 65.0 | 10.0 | 65.0 | auto |
| 6 | 136cce7faa42fdb2cefd53fdc79a6098 | ed0271e0b7da060a393796590e7b737a | invoiced | 2017-04-11 12:22:08 | 2017-04-13 13:25:17 | NaN | NaN | 2017-05-09 00:00:00 | 2.0 | 0.0 | 36.0 | 2017-05-13 20:25:42 | credit_card | 1.0 | 1.0 | 65.95 | a1804276d9941ac0733cfd409f5206eb | 1.0 | 49.90 | 16.05 | 36edbb3fb164b1f16485364b6fb04c73 | 98900 | santa rosa | RS | a1804276d9941ac0733cfd409f5206eb | 0.0 | 0.0 | 600.0 | 35.0 | 35.0 | 15.0 | unknown |
| 7 | 6514b8ad8028c9f2cc2374ded245783f | 9bdf08b4b3b52b5526ff42d37d47f222 | delivered | 2017-05-16 13:10:30 | 2017-05-16 13:22:11 | 2017-05-22 10:07:46 | 2017-05-26 12:55:51 | 2017-06-07 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-05-28 02:59:57 | credit_card | 1.0 | 3.0 | 75.16 | 4520766ec412348b8d4caa5e8a18c464 | 1.0 | 59.99 | 15.17 | 932afa1e708222e5821dac9cd5db4cae | 26525 | nilopolis | RJ | 4520766ec412348b8d4caa5e8a18c464 | 956.0 | 1.0 | 50.0 | 16.0 | 16.0 | 17.0 | auto |
| 8 | 76c6e866289321a7c93b82b54852dc33 | f54a9f0e6b351c431402b8461ea51999 | delivered | 2017-01-23 18:29:09 | 2017-01-25 02:50:47 | 2017-01-26 14:16:31 | 2017-02-02 14:08:10 | 2017-03-06 00:00:00 | 1.0 | 0.0 | 0.0 | 2017-02-05 01:58:35 | boleto | 1.0 | 1.0 | 35.95 | ac1789e492dcd698c5c10b97a671243a | 1.0 | 19.90 | 16.05 | 39382392765b6dc74812866ee5ee92a7 | 99655 | faxinalzinho | RS | ac1789e492dcd698c5c10b97a671243a | 432.0 | 2.0 | 300.0 | 35.0 | 35.0 | 15.0 | furniture_decor |
| 9 | e69bfb5eb88e0ed6a785585b27e16dbf | 31ad1d1b63eb9962463f764d4e6e0c9d | delivered | 2017-07-29 11:55:02 | 2017-07-29 12:05:32 | 2017-08-10 19:45:24 | 2017-08-16 17:14:30 | 2017-08-23 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-08-18 01:47:32 | credit_card,voucher | 2.0 | 1.0 | 169.76 | 9a78fb9862b10749a117f7fc3c31f051 | 1.0 | 149.99 | 19.77 | 299905e3934e9e181bfb2e164dd4b4f8 | 18075 | sorocaba | SP | 9a78fb9862b10749a117f7fc3c31f051 | 527.0 | 1.0 | 9750.0 | 42.0 | 41.0 | 42.0 | office_furniture |
| order_id | customer_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | review_score | length_comment_title | length_comment_message | review_answer_timestamp | payment_type | payment_sequential | payment_installments | payment_value | product_most_frequent | nb_items | sum_price | sum_freight_value | customer_unique_id | customer_zip_code_prefix | customer_city | customer_state | product_id | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99431 | b0f4af5c1b06e24fef510703bfe9f0a6 | 8e1ec396e317ff4c82a03ce16a0c3eb3 | delivered | 2017-10-27 15:21:00 | 2017-10-27 15:32:49 | 2017-10-30 15:44:34 | 2017-11-10 17:57:22 | 2017-11-22 00:00:00 | 5.0 | 0.0 | 77.0 | 2017-11-15 09:54:14 | credit_card | 1.0 | 3.0 | 164.30 | 595fac2a385ac33a80bd5114aec74eb8 | 1.0 | 142.50 | 21.80 | 1a3b8f1d0782ebedbcf220a96cbc1655 | 57042 | maceio | AL | 595fac2a385ac33a80bd5114aec74eb8 | 178.0 | 1.0 | 400.0 | 19.0 | 13.0 | 19.0 | perfumery |
| 99432 | cfa78b997e329a5295b4ee6972c02979 | a2f7428f0cafbc8e59f20e1444b67315 | delivered | 2017-12-20 09:52:41 | 2017-12-20 10:09:52 | 2017-12-20 20:25:25 | 2018-01-26 15:45:14 | 2018-01-18 00:00:00 | 1.0 | 0.0 | 86.0 | 2018-01-21 02:51:39 | credit_card | 1.0 | 1.0 | 71.04 | 3d2c44374ee42b3003a470f3e937a2ea | 1.0 | 55.90 | 15.14 | a49e8e11e850592fe685ae3c64b40eca | 83870 | campo do tenente | PR | 3d2c44374ee42b3003a470f3e937a2ea | 372.0 | 2.0 | 300.0 | 16.0 | 6.0 | 12.0 | musical_instruments |
| 99433 | 9115830be804184b91f5c00f6f49f92d | da2124f134f5dfbce9d06f29bdb6c308 | delivered | 2017-10-04 19:57:37 | 2017-10-04 20:07:14 | 2017-10-05 16:52:52 | 2017-10-20 20:25:45 | 2017-11-07 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-10-23 14:48:40 | credit_card,voucher | 2.0 | 2.0 | 106.79 | 49d2e2460386273b195e7e59b43587c3 | 2.0 | 69.01 | 37.78 | c716cf2b5b86fb24257cffe9e7969df8 | 78048 | cuiaba | MT | 49d2e2460386273b195e7e59b43587c3 | 180.0 | 3.0 | 750.0 | 26.0 | 15.0 | 26.0 | toys |
| 99434 | aa04ef5214580b06b10e2a378300db44 | f01a6bfcc730456317e4081fe0c9940e | delivered | 2017-01-27 00:30:03 | 2017-01-27 01:05:25 | 2017-01-30 11:40:16 | 2017-02-07 13:15:25 | 2017-03-17 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-02-11 12:37:36 | credit_card,voucher | 2.0 | 5.0 | 389.43 | 9fc063fd34fed29ccc57b7f8e8d03388 | 1.0 | 370.00 | 19.43 | e03dbdf5e56c96b106d8115ac336f47f | 35502 | divinopolis | MG | 9fc063fd34fed29ccc57b7f8e8d03388 | 657.0 | 1.0 | 750.0 | 38.0 | 12.0 | 25.0 | health_beauty |
| 99435 | 880675dff2150932f1601e1c07eadeeb | 47cd45a6ac7b9fb16537df2ccffeb5ac | delivered | 2017-02-23 09:05:12 | 2017-02-23 09:15:11 | 2017-03-01 10:22:52 | 2017-03-06 11:08:08 | 2017-03-22 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-03-11 15:42:41 | credit_card | 1.0 | 3.0 | 155.99 | ea73128566d1b082e5101ce46f8107c7 | 1.0 | 139.90 | 16.09 | 831ce3f1bacbd424fc4e38fbd4d66d29 | 5127 | sao paulo | SP | ea73128566d1b082e5101ce46f8107c7 | 254.0 | 2.0 | 2500.0 | 49.0 | 13.0 | 41.0 | furniture_decor |
| 99436 | 9c5dedf39a927c1b2549525ed64a053c | 39bd1228ee8140590ac3aca26f2dfe00 | delivered | 2017-03-09 09:54:05 | 2017-03-09 09:54:05 | 2017-03-10 11:18:03 | 2017-03-17 15:08:01 | 2017-03-28 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-03-23 11:02:08 | credit_card | 1.0 | 3.0 | 85.08 | ac35486adb7b02598c182c2ff2e05254 | 1.0 | 72.00 | 13.08 | 6359f309b166b0196dbf7ad2ac62bb5a | 12209 | sao jose dos campos | SP | ac35486adb7b02598c182c2ff2e05254 | 1517.0 | 1.0 | 1175.0 | 22.0 | 13.0 | 18.0 | health_beauty |
| 99437 | 63943bddc261676b46f01ca7ac2f7bd8 | 1fca14ff2861355f6e5f14306ff977a7 | delivered | 2018-02-06 12:58:58 | 2018-02-06 13:10:37 | 2018-02-07 23:22:42 | 2018-02-28 17:37:56 | 2018-03-02 00:00:00 | 4.0 | 0.0 | 44.0 | 2018-03-02 17:50:01 | credit_card | 1.0 | 3.0 | 195.00 | f1d4ce8c6dd66c47bbaa8c6781c2a923 | 1.0 | 174.90 | 20.10 | da62f9e57a76d978d02ab5362c509660 | 11722 | praia grande | SP | f1d4ce8c6dd66c47bbaa8c6781c2a923 | 828.0 | 4.0 | 4950.0 | 40.0 | 10.0 | 40.0 | baby |
| 99438 | 83c1379a015df1e13d02aae0204711ab | 1aa71eb042121263aafbe80c1b562c9c | delivered | 2017-08-27 14:46:43 | 2017-08-27 15:04:16 | 2017-08-28 20:52:26 | 2017-09-21 11:24:17 | 2017-09-27 00:00:00 | 5.0 | 0.0 | 28.0 | 2017-09-22 23:10:57 | credit_card | 1.0 | 5.0 | 271.01 | b80910977a37536adeddd63663f916ad | 1.0 | 205.99 | 65.02 | 737520a9aad80b3fbbdad19b66b37b30 | 45920 | nova vicosa | BA | b80910977a37536adeddd63663f916ad | 500.0 | 2.0 | 13300.0 | 32.0 | 90.0 | 22.0 | home_appliances_2 |
| 99439 | 11c177c8e97725db2631073c19f07b62 | b331b74b18dc79bcdf6532d51e1637c1 | delivered | 2018-01-08 21:28:27 | 2018-01-08 21:36:21 | 2018-01-12 15:35:03 | 2018-01-25 23:32:54 | 2018-02-15 00:00:00 | 2.0 | 0.0 | 53.0 | 2018-01-27 09:16:56 | credit_card | 1.0 | 4.0 | 441.16 | d1c427060a0f73f6b889a5c7c61f2ac4 | 2.0 | 359.98 | 81.18 | 5097a5312c8b157bb7be58ae360ef43c | 28685 | japuiba | RJ | d1c427060a0f73f6b889a5c7c61f2ac4 | 1893.0 | 1.0 | 6550.0 | 20.0 | 20.0 | 20.0 | computers_accessories |
| 99440 | 66dea50a8b16d9b4dee7af250b4be1a5 | edb027a75a1449115f6b43211ae02a24 | delivered | 2018-03-08 20:57:30 | 2018-03-09 11:20:28 | 2018-03-09 22:11:59 | 2018-03-16 13:08:30 | 2018-04-03 00:00:00 | 5.0 | 0.0 | 0.0 | 2018-03-17 16:33:31 | debit_card | 1.0 | 1.0 | 86.86 | 006619bbed68b000c8ba3f8725d5409e | 1.0 | 68.50 | 18.36 | 60350aa974b26ff12caad89e55993bd6 | 83750 | lapa | PR | 006619bbed68b000c8ba3f8725d5409e | 569.0 | 1.0 | 150.0 | 16.0 | 7.0 | 15.0 | health_beauty |